Cargando…
Reconstructing SNP allele and genotype frequencies from GWAS summary statistics
The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcom...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114146/ https://www.ncbi.nlm.nih.gov/pubmed/35581276 http://dx.doi.org/10.1038/s41598-022-12185-6 |
_version_ | 1784709720168202240 |
---|---|
author | Yang, Zhiyu Paschou, Peristera Drineas, Petros |
author_facet | Yang, Zhiyu Paschou, Peristera Drineas, Petros |
author_sort | Yang, Zhiyu |
collection | PubMed |
description | The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcoming barriers that often exist in individual-level data access while also offering significant computational savings. Such summary-statistics-based applications include GWAS meta-analysis, with and without sample overlap, and case-case GWAS. We compare performance of leading methods for summary-statistics-based genomic analysis and also introduce a novel framework that can unify usual summary-statistics-based implementations via the reconstruction of allelic and genotypic frequencies and counts (ReACt). First, we evaluate ASSET, METAL, and ReACt using both synthetic and real data for GWAS meta-analysis (with and without sample overlap) and find that, while all three methods are comparable in terms of power and error control, ReACt and METAL are faster than ASSET by a factor of at least hundred. We then proceed to evaluate performance of ReACt vs an existing method for case-case GWAS and show comparable performance, with ReACt requiring minimal underlying assumptions and being more user-friendly. Finally, ReACt allows us to evaluate, for the first time, an implementation for calculating polygenic risk score (PRS) for groups of cases and controls based on summary statistics. Our work demonstrates the power of GWAS summary-statistics-based methodologies and the proposed novel method provides a unifying framework and allows further extension of possibilities for researchers seeking to understand the genetics of complex disease. |
format | Online Article Text |
id | pubmed-9114146 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-91141462022-05-19 Reconstructing SNP allele and genotype frequencies from GWAS summary statistics Yang, Zhiyu Paschou, Peristera Drineas, Petros Sci Rep Article The emergence of genome-wide association studies (GWAS) has led to the creation of large repositories of human genetic variation, creating enormous opportunities for genetic research and worldwide collaboration. Methods that are based on GWAS summary statistics seek to leverage such records, overcoming barriers that often exist in individual-level data access while also offering significant computational savings. Such summary-statistics-based applications include GWAS meta-analysis, with and without sample overlap, and case-case GWAS. We compare performance of leading methods for summary-statistics-based genomic analysis and also introduce a novel framework that can unify usual summary-statistics-based implementations via the reconstruction of allelic and genotypic frequencies and counts (ReACt). First, we evaluate ASSET, METAL, and ReACt using both synthetic and real data for GWAS meta-analysis (with and without sample overlap) and find that, while all three methods are comparable in terms of power and error control, ReACt and METAL are faster than ASSET by a factor of at least hundred. We then proceed to evaluate performance of ReACt vs an existing method for case-case GWAS and show comparable performance, with ReACt requiring minimal underlying assumptions and being more user-friendly. Finally, ReACt allows us to evaluate, for the first time, an implementation for calculating polygenic risk score (PRS) for groups of cases and controls based on summary statistics. Our work demonstrates the power of GWAS summary-statistics-based methodologies and the proposed novel method provides a unifying framework and allows further extension of possibilities for researchers seeking to understand the genetics of complex disease. Nature Publishing Group UK 2022-05-17 /pmc/articles/PMC9114146/ /pubmed/35581276 http://dx.doi.org/10.1038/s41598-022-12185-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Yang, Zhiyu Paschou, Peristera Drineas, Petros Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title | Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title_full | Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title_fullStr | Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title_full_unstemmed | Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title_short | Reconstructing SNP allele and genotype frequencies from GWAS summary statistics |
title_sort | reconstructing snp allele and genotype frequencies from gwas summary statistics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114146/ https://www.ncbi.nlm.nih.gov/pubmed/35581276 http://dx.doi.org/10.1038/s41598-022-12185-6 |
work_keys_str_mv | AT yangzhiyu reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics AT paschouperistera reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics AT drineaspetros reconstructingsnpalleleandgenotypefrequenciesfromgwassummarystatistics |