Cargando…

Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics

The polygenic risk score (PRS) is calculated as the weighted sum of an individual’s genotypes and their estimated effect sizes, which is often used to estimate an individual’s genetic susceptibility to complex traits and disorders. It is well known that some complex human traits or disorders have se...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Chi, Ye, Yixuan, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9304553/
https://www.ncbi.nlm.nih.gov/pubmed/35873490
http://dx.doi.org/10.3389/fgene.2022.892950
_version_ 1784752112930914304
author Zhang, Chi
Ye, Yixuan
Zhao, Hongyu
author_facet Zhang, Chi
Ye, Yixuan
Zhao, Hongyu
author_sort Zhang, Chi
collection PubMed
description The polygenic risk score (PRS) is calculated as the weighted sum of an individual’s genotypes and their estimated effect sizes, which is often used to estimate an individual’s genetic susceptibility to complex traits and disorders. It is well known that some complex human traits or disorders have sex differences in trait distributions, disease onset, progression, and treatment response, although the underlying mechanisms causing these sex differences remain largely unknown. PRSs for these traits are often based on Genome-Wide Association Studies (GWAS) data with both male and female samples included, ignoring sex differences. In this study, we present a benchmark study using both simulations with various combinations of genetic correlation and sample size ratios between sexes and real data to investigate whether combining sex-specific PRSs can outperform sex-agnostic PRSs on traits showing sex differences. We consider two types of PRS models in our study: single-population PRS models (PRScs, LDpred2) and multiple-population PRS models (PRScsx). For each trait or disorder, the candidate PRSs were calculated based on sex-specific GWAS data and sex-agnostic GWAS data. The simulation results show that applying LDpred2 or PRScsx to sex-specific GWAS data and then combining sex-specific PRSs leads to the highest prediction accuracy when the genetic correlation between sexes is low and the sample sizes for both sexes are balanced and large. Otherwise, the PRS generated by applying LDpred2 or PRScs to sex-agnostic GWAS data is more appropriate. If the sample sizes between sexes are not too small and very unbalanced, combining LDpred2-based sex-specific PRSs to predict on the sex with a larger sample size and combining PRScsx-based sex-specific PRSs to predict on the sex with a smaller size are the preferred strategies. For real data, we considered 19 traits from Genetic Investigation of ANthropometric Traits (GIANT) consortium studies and UK Biobank with both sex-specific GWAS data and sex-agnostic GWAS data. We found that for waist-to-hip ratio (WHR) related traits, accounting for sex differences and incorporating information from the opposite sex could help improve PRS prediction accuracy. Taken together, our findings in this study provide guidance on how to calculate the best PRS for sex-differentiated traits or disorders, especially as the sample size of GWASs grows in the future.
format Online
Article
Text
id pubmed-9304553
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93045532022-07-23 Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics Zhang, Chi Ye, Yixuan Zhao, Hongyu Front Genet Genetics The polygenic risk score (PRS) is calculated as the weighted sum of an individual’s genotypes and their estimated effect sizes, which is often used to estimate an individual’s genetic susceptibility to complex traits and disorders. It is well known that some complex human traits or disorders have sex differences in trait distributions, disease onset, progression, and treatment response, although the underlying mechanisms causing these sex differences remain largely unknown. PRSs for these traits are often based on Genome-Wide Association Studies (GWAS) data with both male and female samples included, ignoring sex differences. In this study, we present a benchmark study using both simulations with various combinations of genetic correlation and sample size ratios between sexes and real data to investigate whether combining sex-specific PRSs can outperform sex-agnostic PRSs on traits showing sex differences. We consider two types of PRS models in our study: single-population PRS models (PRScs, LDpred2) and multiple-population PRS models (PRScsx). For each trait or disorder, the candidate PRSs were calculated based on sex-specific GWAS data and sex-agnostic GWAS data. The simulation results show that applying LDpred2 or PRScsx to sex-specific GWAS data and then combining sex-specific PRSs leads to the highest prediction accuracy when the genetic correlation between sexes is low and the sample sizes for both sexes are balanced and large. Otherwise, the PRS generated by applying LDpred2 or PRScs to sex-agnostic GWAS data is more appropriate. If the sample sizes between sexes are not too small and very unbalanced, combining LDpred2-based sex-specific PRSs to predict on the sex with a larger sample size and combining PRScsx-based sex-specific PRSs to predict on the sex with a smaller size are the preferred strategies. For real data, we considered 19 traits from Genetic Investigation of ANthropometric Traits (GIANT) consortium studies and UK Biobank with both sex-specific GWAS data and sex-agnostic GWAS data. We found that for waist-to-hip ratio (WHR) related traits, accounting for sex differences and incorporating information from the opposite sex could help improve PRS prediction accuracy. Taken together, our findings in this study provide guidance on how to calculate the best PRS for sex-differentiated traits or disorders, especially as the sample size of GWASs grows in the future. Frontiers Media S.A. 2022-07-08 /pmc/articles/PMC9304553/ /pubmed/35873490 http://dx.doi.org/10.3389/fgene.2022.892950 Text en Copyright © 2022 Zhang, Ye and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Chi
Ye, Yixuan
Zhao, Hongyu
Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title_full Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title_fullStr Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title_full_unstemmed Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title_short Comparison of Methods Utilizing Sex-Specific PRSs Derived From GWAS Summary Statistics
title_sort comparison of methods utilizing sex-specific prss derived from gwas summary statistics
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9304553/
https://www.ncbi.nlm.nih.gov/pubmed/35873490
http://dx.doi.org/10.3389/fgene.2022.892950
work_keys_str_mv AT zhangchi comparisonofmethodsutilizingsexspecificprssderivedfromgwassummarystatistics
AT yeyixuan comparisonofmethodsutilizingsexspecificprssderivedfromgwassummarystatistics
AT zhaohongyu comparisonofmethodsutilizingsexspecificprssderivedfromgwassummarystatistics