Cargando…
Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits
Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signali...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316356/ https://www.ncbi.nlm.nih.gov/pubmed/32542026 http://dx.doi.org/10.1371/journal.pgen.1008855 |
_version_ | 1783550423107371008 |
---|---|
author | Cheng, Wei Ramachandran, Sohini Crawford, Lorin |
author_facet | Cheng, Wei Ramachandran, Sohini Crawford, Lorin |
author_sort | Cheng, Wei |
collection | PubMed |
description | Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signaling pathways enriched for mutations in quantitative traits or case-control studies, but these can be computationally costly and hampered by strict model assumptions. Here, we present gene-ε, a new approach for identifying statistical associations between sets of variants and quantitative traits. Our key insight is that enrichment studies on the gene-level are improved when we reformulate the genome-wide SNP-level null hypothesis to identify spurious small-to-intermediate SNP effects and classify them as non-causal. gene-ε efficiently identifies enriched genes under a variety of simulated genetic architectures, achieving greater than a 90% true positive rate at 1% false positive rate for polygenic traits. Lastly, we apply gene-ε to summary statistics derived from six quantitative traits using European-ancestry individuals in the UK Biobank, and identify enriched genes that are in biologically relevant pathways. |
format | Online Article Text |
id | pubmed-7316356 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-73163562020-06-30 Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits Cheng, Wei Ramachandran, Sohini Crawford, Lorin PLoS Genet Research Article Traditional univariate genome-wide association studies generate false positives and negatives due to difficulties distinguishing associated variants from variants with spurious nonzero effects that do not directly influence the trait. Recent efforts have been directed at identifying genes or signaling pathways enriched for mutations in quantitative traits or case-control studies, but these can be computationally costly and hampered by strict model assumptions. Here, we present gene-ε, a new approach for identifying statistical associations between sets of variants and quantitative traits. Our key insight is that enrichment studies on the gene-level are improved when we reformulate the genome-wide SNP-level null hypothesis to identify spurious small-to-intermediate SNP effects and classify them as non-causal. gene-ε efficiently identifies enriched genes under a variety of simulated genetic architectures, achieving greater than a 90% true positive rate at 1% false positive rate for polygenic traits. Lastly, we apply gene-ε to summary statistics derived from six quantitative traits using European-ancestry individuals in the UK Biobank, and identify enriched genes that are in biologically relevant pathways. Public Library of Science 2020-06-15 /pmc/articles/PMC7316356/ /pubmed/32542026 http://dx.doi.org/10.1371/journal.pgen.1008855 Text en © 2020 Cheng et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Cheng, Wei Ramachandran, Sohini Crawford, Lorin Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title | Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title_full | Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title_fullStr | Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title_full_unstemmed | Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title_short | Estimation of non-null SNP effect size distributions enables the detection of enriched genes underlying complex traits |
title_sort | estimation of non-null snp effect size distributions enables the detection of enriched genes underlying complex traits |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7316356/ https://www.ncbi.nlm.nih.gov/pubmed/32542026 http://dx.doi.org/10.1371/journal.pgen.1008855 |
work_keys_str_mv | AT chengwei estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits AT ramachandransohini estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits AT crawfordlorin estimationofnonnullsnpeffectsizedistributionsenablesthedetectionofenrichedgenesunderlyingcomplextraits |