Cargando…
ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci
Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326795/ https://www.ncbi.nlm.nih.gov/pubmed/30239796 http://dx.doi.org/10.1093/nar/gky837 |
_version_ | 1783386367924895744 |
---|---|
author | Stacey, David Fauman, Eric B Ziemek, Daniel Sun, Benjamin B Harshfield, Eric L Wood, Angela M Butterworth, Adam S Suhre, Karsten Paul, Dirk S |
author_facet | Stacey, David Fauman, Eric B Ziemek, Daniel Sun, Benjamin B Harshfield, Eric L Wood, Angela M Butterworth, Adam S Suhre, Karsten Paul, Dirk S |
author_sort | Stacey, David |
collection | PubMed |
description | Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the ‘Prioritization of candidate causal Genes at Molecular QTLs’ (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of ‘true positive’ causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub. |
format | Online Article Text |
id | pubmed-6326795 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-63267952019-01-15 ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci Stacey, David Fauman, Eric B Ziemek, Daniel Sun, Benjamin B Harshfield, Eric L Wood, Angela M Butterworth, Adam S Suhre, Karsten Paul, Dirk S Nucleic Acids Res Methods Online Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the ‘Prioritization of candidate causal Genes at Molecular QTLs’ (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories. We assessed the performance of ProGeM using a reference set of 227 previously reported and extensively curated metabolite QTLs. For 98% of these loci, the expert-curated gene was one of the candidate causal genes prioritized by ProGeM. Benchmarking analyses revealed that 69% of the causal candidates were nearest to the sentinel variant at the investigated molecular QTLs, indicating that genomic proximity is the most reliable indicator of ‘true positive’ causal genes. In contrast, cis-gene expression QTL data led to three false positive candidate causal gene assignments for every one true positive assignment. We provide evidence that these conclusions also apply to other molecular phenotypes, suggesting that ProGeM is a powerful and versatile tool for annotating molecular QTLs. ProGeM is freely available via GitHub. Oxford University Press 2019-01-10 2018-09-20 /pmc/articles/PMC6326795/ /pubmed/30239796 http://dx.doi.org/10.1093/nar/gky837 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Stacey, David Fauman, Eric B Ziemek, Daniel Sun, Benjamin B Harshfield, Eric L Wood, Angela M Butterworth, Adam S Suhre, Karsten Paul, Dirk S ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title | ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title_full | ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title_fullStr | ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title_full_unstemmed | ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title_short | ProGeM: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
title_sort | progem: a framework for the prioritization of candidate causal genes at molecular quantitative trait loci |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6326795/ https://www.ncbi.nlm.nih.gov/pubmed/30239796 http://dx.doi.org/10.1093/nar/gky837 |
work_keys_str_mv | AT staceydavid progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT faumanericb progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT ziemekdaniel progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT sunbenjaminb progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT harshfieldericl progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT woodangelam progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT butterworthadams progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT suhrekarsten progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci AT pauldirks progemaframeworkfortheprioritizationofcandidatecausalgenesatmolecularquantitativetraitloci |