Cargando…
Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors
Summary statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or L...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8654883/ https://www.ncbi.nlm.nih.gov/pubmed/34880243 http://dx.doi.org/10.1038/s41467-021-27438-7 |
_version_ | 1784611961711886336 |
---|---|
author | Chen, Wenhan Wu, Yang Zheng, Zhili Qi, Ting Visscher, Peter M. Zhu, Zhihong Yang, Jian |
author_facet | Chen, Wenhan Wu, Yang Zheng, Zhili Qi, Ting Visscher, Peter M. Zhu, Zhihong Yang, Jian |
author_sort | Chen, Wenhan |
collection | PubMed |
description | Summary statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or LD reference or heterogeneity between GWAS and LD reference. Here we propose a quality control method, DENTIST, that leverages LD among genetic variants to detect and eliminate errors in GWAS or LD reference and heterogeneity between the two. Through simulations, we demonstrate that DENTIST substantially reduces false-positive rate in detecting secondary signals in the summary-data-based conditional and joint association analysis, especially for imputed rare variants (false-positive rate reduced from >28% to <2% in the presence of heterogeneity between GWAS and LD reference). We further show that DENTIST can improve other summary-data-based analyses such as fine-mapping analysis. |
format | Online Article Text |
id | pubmed-8654883 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86548832021-12-27 Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors Chen, Wenhan Wu, Yang Zheng, Zhili Qi, Ting Visscher, Peter M. Zhu, Zhihong Yang, Jian Nat Commun Article Summary statistics from genome-wide association studies (GWAS) have facilitated the development of various summary data-based methods, which typically require a reference sample for linkage disequilibrium (LD) estimation. Analyses using these methods may be biased by errors in GWAS summary data or LD reference or heterogeneity between GWAS and LD reference. Here we propose a quality control method, DENTIST, that leverages LD among genetic variants to detect and eliminate errors in GWAS or LD reference and heterogeneity between the two. Through simulations, we demonstrate that DENTIST substantially reduces false-positive rate in detecting secondary signals in the summary-data-based conditional and joint association analysis, especially for imputed rare variants (false-positive rate reduced from >28% to <2% in the presence of heterogeneity between GWAS and LD reference). We further show that DENTIST can improve other summary-data-based analyses such as fine-mapping analysis. Nature Publishing Group UK 2021-12-08 /pmc/articles/PMC8654883/ /pubmed/34880243 http://dx.doi.org/10.1038/s41467-021-27438-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Chen, Wenhan Wu, Yang Zheng, Zhili Qi, Ting Visscher, Peter M. Zhu, Zhihong Yang, Jian Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title | Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title_full | Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title_fullStr | Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title_full_unstemmed | Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title_short | Improved analyses of GWAS summary statistics by reducing data heterogeneity and errors |
title_sort | improved analyses of gwas summary statistics by reducing data heterogeneity and errors |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8654883/ https://www.ncbi.nlm.nih.gov/pubmed/34880243 http://dx.doi.org/10.1038/s41467-021-27438-7 |
work_keys_str_mv | AT chenwenhan improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT wuyang improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT zhengzhili improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT qiting improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT visscherpeterm improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT zhuzhihong improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors AT yangjian improvedanalysesofgwassummarystatisticsbyreducingdataheterogeneityanderrors |