Cargando…

Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits

Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signali...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Wei, Ramachandran, Sohini, Crawford, Lorin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316356/
https://www.ncbi.nlm.nih.gov/pubmed/32542026
http://dx.doi.org/10.1371/journal.pgen.1008855
_version_ 1783550423107371008
author Cheng, Wei
Ramachandran, Sohini
Crawford, Lorin
author_facet Cheng, Wei
Ramachandran, Sohini
Crawford, Lorin
author_sort Cheng, Wei
collection PubMed
description Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signaling pathways enriched for mutations in quantitative traits or case-control studies, but these can be computationally costly and hampered by strict model assumptions. Here, we present gene-ε, a new approach for identifying statistical associations between sets of variants and quantitative traits. Our key insight is that enrichment studies on the gene-level are improved when we reformulate the genome-wide SNP-level null hypothesis to identify spurious small-to-intermediate SNP effects and classify them as non-causal. gene-ε efficiently identifies enriched genes under a variety of simulated genetic architectures, achieving greater than a 90% true positive rate at 1% false positive rate for polygenic traits. Lastly, we apply gene-ε to summary statistics derived from six quantitative traits using European-ancestry individuals in the UK Biobank, and identify enriched genes that are in biologically relevant pathways.
format Online
Article
Text
id pubmed-7316356
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73163562020-06-30 Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits Cheng, Wei Ramachandran, Sohini Crawford, Lorin PLoS Genet Research Article Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signaling pathways enriched for mutations in quantitative traits or case-control studies, but these can be computationally costly and hampered by strict model assumptions. Here, we present gene-ε, a new approach for identifying statistical associations between sets of variants and quantitative traits. Our key insight is that enrichment studies on the gene-level are improved when we reformulate the genome-wide SNP-level null hypothesis to identify spurious small-to-intermediate SNP effects and classify them as non-causal. gene-ε efficiently identifies enriched genes under a variety of simulated genetic architectures, achieving greater than a 90% true positive rate at 1% false positive rate for polygenic traits. Lastly, we apply gene-ε to summary statistics derived from six quantitative traits using European-ancestry individuals in the UK Biobank, and identify enriched genes that are in biologically relevant pathways. Public Library of Science 2020-06-15 /pmc/articles/PMC7316356/ /pubmed/32542026 http://dx.doi.org/10.1371/journal.pgen.1008855 Text en © 2020 Cheng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cheng, Wei
Ramachandran, Sohini
Crawford, Lorin
Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title_full Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title_fullStr Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title_full_unstemmed Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title_short Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
title_sort estimation of non-null snp effect size distributions enables the detection of enriched genes underlying complex traits
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316356/
https://www.ncbi.nlm.nih.gov/pubmed/32542026
http://dx.doi.org/10.1371/journal.pgen.1008855
work_keys_str_mv AT chengwei estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits
AT ramachandransohini estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits
AT crawfordlorin estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits