Cargando…
Testing for differences in polygenic scores in the presence of confounding
Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals’ phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055004/ https://www.ncbi.nlm.nih.gov/pubmed/36993707 http://dx.doi.org/10.1101/2023.03.12.532301 |
_version_ | 1785015802675593216 |
---|---|
author | Blanc, Jennifer Berg, Jeremy J. |
author_facet | Blanc, Jennifer Berg, Jeremy J. |
author_sort | Blanc, Jennifer |
collection | PubMed |
description | Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals’ phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into the evolutionary forces acting on the trait in question, and is important for understanding health disparities. However, because most polygenic scores are computed using effect estimates from population samples, they are susceptible to confounding by both genetic and environmental effects that are correlated with ancestry. The extent to which this confounding drives patterns in the distribution of polygenic scores depends on patterns of population structure in both the original estimation panel and in the prediction/test panel. Here, we use theory from population and statistical genetics, together with simulations, to study the procedure of testing for an association between polygenic scores and axes of ancestry variation in the presence of confounding. We use a general model of genetic relatedness to describe how confounding in the estimation panel biases the distribution of polygenic scores in a way that depends on the degree of overlap in population structure between panels. We then show how this confounding can bias tests for associations between polygenic scores and important axes of ancestry variation in the test panel. Finally, we use the understanding gained from this analysis to develop a method that uses patterns of genetic similarity between the two panels to guard against these biases, and show that this method can provide better protection against confounding than the standard PCA-based approach. |
format | Online Article Text |
id | pubmed-10055004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100550042023-03-30 Testing for differences in polygenic scores in the presence of confounding Blanc, Jennifer Berg, Jeremy J. bioRxiv Article Polygenic scores have become an important tool in human genetics, enabling the prediction of individuals’ phenotypes from their genotypes. Understanding how the pattern of differences in polygenic score predictions across individuals intersects with variation in ancestry can provide insights into the evolutionary forces acting on the trait in question, and is important for understanding health disparities. However, because most polygenic scores are computed using effect estimates from population samples, they are susceptible to confounding by both genetic and environmental effects that are correlated with ancestry. The extent to which this confounding drives patterns in the distribution of polygenic scores depends on patterns of population structure in both the original estimation panel and in the prediction/test panel. Here, we use theory from population and statistical genetics, together with simulations, to study the procedure of testing for an association between polygenic scores and axes of ancestry variation in the presence of confounding. We use a general model of genetic relatedness to describe how confounding in the estimation panel biases the distribution of polygenic scores in a way that depends on the degree of overlap in population structure between panels. We then show how this confounding can bias tests for associations between polygenic scores and important axes of ancestry variation in the test panel. Finally, we use the understanding gained from this analysis to develop a method that uses patterns of genetic similarity between the two panels to guard against these biases, and show that this method can provide better protection against confounding than the standard PCA-based approach. Cold Spring Harbor Laboratory 2023-08-22 /pmc/articles/PMC10055004/ /pubmed/36993707 http://dx.doi.org/10.1101/2023.03.12.532301 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Blanc, Jennifer Berg, Jeremy J. Testing for differences in polygenic scores in the presence of confounding |
title | Testing for differences in polygenic scores in the presence of confounding |
title_full | Testing for differences in polygenic scores in the presence of confounding |
title_fullStr | Testing for differences in polygenic scores in the presence of confounding |
title_full_unstemmed | Testing for differences in polygenic scores in the presence of confounding |
title_short | Testing for differences in polygenic scores in the presence of confounding |
title_sort | testing for differences in polygenic scores in the presence of confounding |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055004/ https://www.ncbi.nlm.nih.gov/pubmed/36993707 http://dx.doi.org/10.1101/2023.03.12.532301 |
work_keys_str_mv | AT blancjennifer testingfordifferencesinpolygenicscoresinthepresenceofconfounding AT bergjeremyj testingfordifferencesinpolygenicscoresinthepresenceofconfounding |