Cargando…

Reconstructing SNP allele and genotype frequencies from GWAS summary statistics

The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcom...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhiyu, Paschou, Peristera, Drineas, Petros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114146/
https://www.ncbi.nlm.nih.gov/pubmed/35581276
http://dx.doi.org/10.1038/s41598-022-12185-6
_version_ 1784709720168202240
author Yang, Zhiyu
Paschou, Peristera
Drineas, Petros
author_facet Yang, Zhiyu
Paschou, Peristera
Drineas, Petros
author_sort Yang, Zhiyu
collection PubMed
description The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcoming barriers that often exist in individual-level data access while also offering significant computational savings. Such summary-statistics-based applications include GWAS meta-analysis, with and without sample overlap, and case-case GWAS. We compare performance of leading methods for summary-statistics-based genomic analysis and also introduce a novel framework that can unify usual summary-statistics-based implementations via the reconstruction of allelic and genotypic frequencies and counts (ReACt). First, we evaluate ASSET, METAL, and ReACt using both synthetic and real data for GWAS meta-analysis (with and without sample overlap) and find that, while all three methods are comparable in terms of power and error control, ReACt and METAL are faster than ASSET by a factor of at least hundred. We then proceed to evaluate performance of ReACt vs an existing method for case-case GWAS and show comparable performance, with ReACt requiring minimal underlying assumptions and being more user-friendly. Finally, ReACt allows us to evaluate, for the first time, an implementation for calculating polygenic risk score (PRS) for groups of cases and controls based on summary statistics. Our work demonstrates the power of GWAS summary-statistics-based methodologies and the proposed novel method provides a unifying framework and allows further extension of possibilities for researchers seeking to understand the genetics of complex disease.
format Online
Article
Text
id pubmed-9114146
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-91141462022-05-19 Reconstructing SNP allele and genotype frequencies from GWAS summary statistics Yang, Zhiyu Paschou, Peristera Drineas, Petros Sci Rep Article The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcoming barriers that often exist in individual-level data access while also offering significant computational savings. Such summary-statistics-based applications include GWAS meta-analysis, with and without sample overlap, and case-case GWAS. We compare performance of leading methods for summary-statistics-based genomic analysis and also introduce a novel framework that can unify usual summary-statistics-based implementations via the reconstruction of allelic and genotypic frequencies and counts (ReACt). First, we evaluate ASSET, METAL, and ReACt using both synthetic and real data for GWAS meta-analysis (with and without sample overlap) and find that, while all three methods are comparable in terms of power and error control, ReACt and METAL are faster than ASSET by a factor of at least hundred. We then proceed to evaluate performance of ReACt vs an existing method for case-case GWAS and show comparable performance, with ReACt requiring minimal underlying assumptions and being more user-friendly. Finally, ReACt allows us to evaluate, for the first time, an implementation for calculating polygenic risk score (PRS) for groups of cases and controls based on summary statistics. Our work demonstrates the power of GWAS summary-statistics-based methodologies and the proposed novel method provides a unifying framework and allows further extension of possibilities for researchers seeking to understand the genetics of complex disease. Nature Publishing Group UK 2022-05-17 /pmc/articles/PMC9114146/ /pubmed/35581276 http://dx.doi.org/10.1038/s41598-022-12185-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Yang, Zhiyu
Paschou, Peristera
Drineas, Petros
Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title_full Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title_fullStr Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title_full_unstemmed Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title_short Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
title_sort reconstructing snp allele and genotype frequencies from gwas summary statistics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114146/
https://www.ncbi.nlm.nih.gov/pubmed/35581276
http://dx.doi.org/10.1038/s41598-022-12185-6
work_keys_str_mv AT yangzhiyu reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics
AT paschouperistera reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics
AT drineaspetros reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics