Cargando…

Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs

The large majority of variants identified by GWAS are non-coding, motivating detailed characterization of the function of non-coding variants. Experimental methods to assess variants’ effect on gene expressions in native chromatin context via direct perturbation are low-throughput. Existing high-thr...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Qingbo S., Kelley, David R., Ulirsch, Jacob, Kanai, Masahiro, Sadhuka, Shuvom, Cui, Ran, Albors, Carlos, Cheng, Nathan, Okada, Yukinori, Aguet, Francois, Ardlie, Kristin G., MacArthur, Daniel G., Finucane, Hilary K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184741/
https://www.ncbi.nlm.nih.gov/pubmed/34099641
http://dx.doi.org/10.1038/s41467-021-23134-8
_version_ 1783704640610631680
author Wang, Qingbo S.
Kelley, David R.
Ulirsch, Jacob
Kanai, Masahiro
Sadhuka, Shuvom
Cui, Ran
Albors, Carlos
Cheng, Nathan
Okada, Yukinori
Aguet, Francois
Ardlie, Kristin G.
MacArthur, Daniel G.
Finucane, Hilary K.
author_facet Wang, Qingbo S.
Kelley, David R.
Ulirsch, Jacob
Kanai, Masahiro
Sadhuka, Shuvom
Cui, Ran
Albors, Carlos
Cheng, Nathan
Okada, Yukinori
Aguet, Francois
Ardlie, Kristin G.
MacArthur, Daniel G.
Finucane, Hilary K.
author_sort Wang, Qingbo S.
collection PubMed
description The large majority of variants identified by GWAS are non-coding, motivating detailed characterization of the function of non-coding variants. Experimental methods to assess variants’ effect on gene expressions in native chromatin context via direct perturbation are low-throughput. Existing high-throughput computational predictors thus have lacked large gold standard sets of regulatory variants for training and validation. Here, we leverage a set of 14,807 putative causal eQTLs in humans obtained through statistical fine-mapping, and we use 6121 features to directly train a predictor of whether a variant modifies nearby gene expression. We call the resulting prediction the expression modifier score (EMS). We validate EMS by comparing its ability to prioritize functional variants with other major scores. We then use EMS as a prior for statistical fine-mapping of eQTLs to identify an additional 20,913 putatively causal eQTLs, and we incorporate EMS into co-localization analysis to identify 310 additional candidate genes across UK Biobank phenotypes.
format Online
Article
Text
id pubmed-8184741
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81847412021-06-09 Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs Wang, Qingbo S. Kelley, David R. Ulirsch, Jacob Kanai, Masahiro Sadhuka, Shuvom Cui, Ran Albors, Carlos Cheng, Nathan Okada, Yukinori Aguet, Francois Ardlie, Kristin G. MacArthur, Daniel G. Finucane, Hilary K. Nat Commun Article The large majority of variants identified by GWAS are non-coding, motivating detailed characterization of the function of non-coding variants. Experimental methods to assess variants’ effect on gene expressions in native chromatin context via direct perturbation are low-throughput. Existing high-throughput computational predictors thus have lacked large gold standard sets of regulatory variants for training and validation. Here, we leverage a set of 14,807 putative causal eQTLs in humans obtained through statistical fine-mapping, and we use 6121 features to directly train a predictor of whether a variant modifies nearby gene expression. We call the resulting prediction the expression modifier score (EMS). We validate EMS by comparing its ability to prioritize functional variants with other major scores. We then use EMS as a prior for statistical fine-mapping of eQTLs to identify an additional 20,913 putatively causal eQTLs, and we incorporate EMS into co-localization analysis to identify 310 additional candidate genes across UK Biobank phenotypes. Nature Publishing Group UK 2021-06-07 /pmc/articles/PMC8184741/ /pubmed/34099641 http://dx.doi.org/10.1038/s41467-021-23134-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Wang, Qingbo S.
Kelley, David R.
Ulirsch, Jacob
Kanai, Masahiro
Sadhuka, Shuvom
Cui, Ran
Albors, Carlos
Cheng, Nathan
Okada, Yukinori
Aguet, Francois
Ardlie, Kristin G.
MacArthur, Daniel G.
Finucane, Hilary K.
Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title_full Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title_fullStr Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title_full_unstemmed Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title_short Leveraging supervised learning for functionally informed fine-mapping of cis-eQTLs identifies an additional 20,913 putative causal eQTLs
title_sort leveraging supervised learning for functionally informed fine-mapping of cis-eqtls identifies an additional 20,913 putative causal eqtls
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8184741/
https://www.ncbi.nlm.nih.gov/pubmed/34099641
http://dx.doi.org/10.1038/s41467-021-23134-8
work_keys_str_mv AT wangqingbos leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT kelleydavidr leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT ulirschjacob leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT kanaimasahiro leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT sadhukashuvom leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT cuiran leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT alborscarlos leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT chengnathan leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT okadayukinori leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT aguetfrancois leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT ardliekristing leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT macarthurdanielg leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls
AT finucanehilaryk leveragingsupervisedlearningforfunctionallyinformedfinemappingofciseqtlsidentifiesanadditional20913putativecausaleqtls