Cargando…

GWAS quality score for evaluating associated regions in GWAS analyses

MOTIVATION: The number of significantly associated regions reported in genome-wide association studies (GWAS) for polygenic traits typically increases with sample size. A traditional tool for quality control and identification of significant regions has been a visual inspection of how significant an...

Descripción completa

Detalles Bibliográficos
Autores principales: Awasthi, Swapnil, Chen, Chia-Yen, Lam, Max, Huang, Hailiang, Ripke, Stephan, Altar, C Anthony
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9891241/
https://www.ncbi.nlm.nih.gov/pubmed/36651666
http://dx.doi.org/10.1093/bioinformatics/btad004
_version_ 1784881101359022080
author Awasthi, Swapnil
Chen, Chia-Yen
Lam, Max
Huang, Hailiang
Ripke, Stephan
Altar, C Anthony
author_facet Awasthi, Swapnil
Chen, Chia-Yen
Lam, Max
Huang, Hailiang
Ripke, Stephan
Altar, C Anthony
author_sort Awasthi, Swapnil
collection PubMed
description MOTIVATION: The number of significantly associated regions reported in genome-wide association studies (GWAS) for polygenic traits typically increases with sample size. A traditional tool for quality control and identification of significant regions has been a visual inspection of how significant and correlated genetic variants cluster within a region. However, while inspecting hundreds of regions, this subjective method can misattribute significance to some loci or neglect others that are significant. RESULTS: The GWAS quality score (GQS) identifies suspicious regions and prevents erroneous interpretations with an objective, quantitative and automated method. The GQS assesses all measured single nucleotide polymorphisms (SNPs) that are linked by inheritance to each other [linkage disequilibrium (LD)] and compares the significance of trait association of each SNP to its LD value for the reported index SNP. A GQS value of 1.0 ascribes a high level of confidence to the entire region and its underlying gene(s), while GQS values <1.0 indicate the need to closely inspect the outliers. We applied the GQS to published and non-published genome-wide summary statistics and report suspicious regions requiring secondary inspection while supporting the majority of reported regions from large-scale published meta-analyses. AVAILABILITY AND IMPLEMENTATION: The GQS code/scripts can be cloned from GitHub (https://github.com/Xswapnil/GQS/). The analyst can use whole-genome summary statistics to estimate GQS for each defined region. We also provide an online tool (http://35.227.18.38/) that gives access to the GQS. The quantitative measure of quality attributes by GQS and its visualization is an objective method that enhances the confidence of each genomic hit. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9891241
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98912412023-02-02 GWAS quality score for evaluating associated regions in GWAS analyses Awasthi, Swapnil Chen, Chia-Yen Lam, Max Huang, Hailiang Ripke, Stephan Altar, C Anthony Bioinformatics Original Paper MOTIVATION: The number of significantly associated regions reported in genome-wide association studies (GWAS) for polygenic traits typically increases with sample size. A traditional tool for quality control and identification of significant regions has been a visual inspection of how significant and correlated genetic variants cluster within a region. However, while inspecting hundreds of regions, this subjective method can misattribute significance to some loci or neglect others that are significant. RESULTS: The GWAS quality score (GQS) identifies suspicious regions and prevents erroneous interpretations with an objective, quantitative and automated method. The GQS assesses all measured single nucleotide polymorphisms (SNPs) that are linked by inheritance to each other [linkage disequilibrium (LD)] and compares the significance of trait association of each SNP to its LD value for the reported index SNP. A GQS value of 1.0 ascribes a high level of confidence to the entire region and its underlying gene(s), while GQS values <1.0 indicate the need to closely inspect the outliers. We applied the GQS to published and non-published genome-wide summary statistics and report suspicious regions requiring secondary inspection while supporting the majority of reported regions from large-scale published meta-analyses. AVAILABILITY AND IMPLEMENTATION: The GQS code/scripts can be cloned from GitHub (https://github.com/Xswapnil/GQS/). The analyst can use whole-genome summary statistics to estimate GQS for each defined region. We also provide an online tool (http://35.227.18.38/) that gives access to the GQS. The quantitative measure of quality attributes by GQS and its visualization is an objective method that enhances the confidence of each genomic hit. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2023-01-18 /pmc/articles/PMC9891241/ /pubmed/36651666 http://dx.doi.org/10.1093/bioinformatics/btad004 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Awasthi, Swapnil
Chen, Chia-Yen
Lam, Max
Huang, Hailiang
Ripke, Stephan
Altar, C Anthony
GWAS quality score for evaluating associated regions in GWAS analyses
title GWAS quality score for evaluating associated regions in GWAS analyses
title_full GWAS quality score for evaluating associated regions in GWAS analyses
title_fullStr GWAS quality score for evaluating associated regions in GWAS analyses
title_full_unstemmed GWAS quality score for evaluating associated regions in GWAS analyses
title_short GWAS quality score for evaluating associated regions in GWAS analyses
title_sort gwas quality score for evaluating associated regions in gwas analyses
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9891241/
https://www.ncbi.nlm.nih.gov/pubmed/36651666
http://dx.doi.org/10.1093/bioinformatics/btad004
work_keys_str_mv AT awasthiswapnil gwasqualityscoreforevaluatingassociatedregionsingwasanalyses
AT chenchiayen gwasqualityscoreforevaluatingassociatedregionsingwasanalyses
AT lammax gwasqualityscoreforevaluatingassociatedregionsingwasanalyses
AT huanghailiang gwasqualityscoreforevaluatingassociatedregionsingwasanalyses
AT ripkestephan gwasqualityscoreforevaluatingassociatedregionsingwasanalyses
AT altarcanthony gwasqualityscoreforevaluatingassociatedregionsingwasanalyses