Cargando…

ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci

Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the...

Descripción completa

Detalles Bibliográficos
Autores principales: Stacey, David, Fauman, Eric B, Ziemek, Daniel, Sun, Benjamin B, Harshfield, Eric L, Wood, Angela M, Butterworth, Adam S, Suhre, Karsten, Paul, Dirk S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326795/
https://www.ncbi.nlm.nih.gov/pubmed/30239796
http://dx.doi.org/10.1093/nar/gky837
_version_ 1783386367924895744
author Stacey, David
Fauman, Eric B
Ziemek, Daniel
Sun, Benjamin B
Harshfield, Eric L
Wood, Angela M
Butterworth, Adam S
Suhre, Karsten
Paul, Dirk S
author_facet Stacey, David
Fauman, Eric B
Ziemek, Daniel
Sun, Benjamin B
Harshfield, Eric L
Wood, Angela M
Butterworth, Adam S
Suhre, Karsten
Paul, Dirk S
author_sort Stacey, David
collection PubMed
description Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the ‘Prioritization of candidate causal Genes at Molecular QTLs’ (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of ‘true positive’ causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub.
format Online
Article
Text
id pubmed-6326795
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63267952019-01-15 ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci Stacey, David Fauman, Eric B Ziemek, Daniel Sun, Benjamin B Harshfield, Eric L Wood, Angela M Butterworth, Adam S Suhre, Karsten Paul, Dirk S Nucleic Acids Res Methods Online Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the ‘Prioritization of candidate causal Genes at Molecular QTLs’ (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of ‘true positive’ causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub. Oxford University Press 2019-01-10 2018-09-20 /pmc/articles/PMC6326795/ /pubmed/30239796 http://dx.doi.org/10.1093/nar/gky837 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Stacey, David
Fauman, Eric B
Ziemek, Daniel
Sun, Benjamin B
Harshfield, Eric L
Wood, Angela M
Butterworth, Adam S
Suhre, Karsten
Paul, Dirk S
ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title_full ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title_fullStr ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title_full_unstemmed ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title_short ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
title_sort progem: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326795/
https://www.ncbi.nlm.nih.gov/pubmed/30239796
http://dx.doi.org/10.1093/nar/gky837
work_keys_str_mv AT staceydavid progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT faumanericb progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT ziemekdaniel progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT sunbenjaminb progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT harshfieldericl progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT woodangelam progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT butterworthadams progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT suhrekarsten progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci
AT pauldirks progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci