Cargando…

Learning a Prior on Regulatory Potential from eQTL Data

Genome-wide RNA expression data provide a detailed view of an organism's biological state; hence, a dataset measuring expression variation between genetically diverse individuals (eQTL data) may provide important insights into the genetics of complex traits. However, with data from a relatively...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Su-In, Dudley, Aimée M., Drubin, David, Silver, Pamela A., Krogan, Nevan J., Pe'er, Dana, Koller, Daphne
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2627940/
https://www.ncbi.nlm.nih.gov/pubmed/19180192
http://dx.doi.org/10.1371/journal.pgen.1000358
_version_ 1782163624619409408
author Lee, Su-In
Dudley, Aimée M.
Drubin, David
Silver, Pamela A.
Krogan, Nevan J.
Pe'er, Dana
Koller, Daphne
author_facet Lee, Su-In
Dudley, Aimée M.
Drubin, David
Silver, Pamela A.
Krogan, Nevan J.
Pe'er, Dana
Koller, Daphne
author_sort Lee, Su-In
collection PubMed
description Genome-wide RNA expression data provide a detailed view of an organism's biological state; hence, a dataset measuring expression variation between genetically diverse individuals (eQTL data) may provide important insights into the genetics of complex traits. However, with data from a relatively small number of individuals, it is difficult to distinguish true causal polymorphisms from the large number of possibilities. The problem is particularly challenging in populations with significant linkage disequilibrium, where traits are often linked to large chromosomal regions containing many genes. Here, we present a novel method, Lirnet, that automatically learns a regulatory potential for each sequence polymorphism, estimating how likely it is to have a significant effect on gene expression. This regulatory potential is defined in terms of “regulatory features”—including the function of the gene and the conservation, type, and position of genetic polymorphisms—that are available for any organism. The extent to which the different features influence the regulatory potential is learned automatically, making Lirnet readily applicable to different datasets, organisms, and feature sets. We apply Lirnet both to the human HapMap eQTL dataset and to a yeast eQTL dataset and provide statistical and biological results demonstrating that Lirnet produces significantly better regulatory programs than other recent approaches. We demonstrate in the yeast data that Lirnet can correctly suggest a specific causal sequence variation within a large, linked chromosomal region. In one example, Lirnet uncovered a novel, experimentally validated connection between Puf3—a sequence-specific RNA binding protein—and P-bodies—cytoplasmic structures that regulate translation and RNA stability—as well as the particular causative polymorphism, a SNP in Mkt1, that induces the variation in the pathway.
format Text
id pubmed-2627940
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26279402009-01-30 Learning a Prior on Regulatory Potential from eQTL Data Lee, Su-In Dudley, Aimée M. Drubin, David Silver, Pamela A. Krogan, Nevan J. Pe'er, Dana Koller, Daphne PLoS Genet Research Article Genome-wide RNA expression data provide a detailed view of an organism's biological state; hence, a dataset measuring expression variation between genetically diverse individuals (eQTL data) may provide important insights into the genetics of complex traits. However, with data from a relatively small number of individuals, it is difficult to distinguish true causal polymorphisms from the large number of possibilities. The problem is particularly challenging in populations with significant linkage disequilibrium, where traits are often linked to large chromosomal regions containing many genes. Here, we present a novel method, Lirnet, that automatically learns a regulatory potential for each sequence polymorphism, estimating how likely it is to have a significant effect on gene expression. This regulatory potential is defined in terms of “regulatory features”—including the function of the gene and the conservation, type, and position of genetic polymorphisms—that are available for any organism. The extent to which the different features influence the regulatory potential is learned automatically, making Lirnet readily applicable to different datasets, organisms, and feature sets. We apply Lirnet both to the human HapMap eQTL dataset and to a yeast eQTL dataset and provide statistical and biological results demonstrating that Lirnet produces significantly better regulatory programs than other recent approaches. We demonstrate in the yeast data that Lirnet can correctly suggest a specific causal sequence variation within a large, linked chromosomal region. In one example, Lirnet uncovered a novel, experimentally validated connection between Puf3—a sequence-specific RNA binding protein—and P-bodies—cytoplasmic structures that regulate translation and RNA stability—as well as the particular causative polymorphism, a SNP in Mkt1, that induces the variation in the pathway. Public Library of Science 2009-01-30 /pmc/articles/PMC2627940/ /pubmed/19180192 http://dx.doi.org/10.1371/journal.pgen.1000358 Text en Lee et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lee, Su-In
Dudley, Aimée M.
Drubin, David
Silver, Pamela A.
Krogan, Nevan J.
Pe'er, Dana
Koller, Daphne
Learning a Prior on Regulatory Potential from eQTL Data
title Learning a Prior on Regulatory Potential from eQTL Data
title_full Learning a Prior on Regulatory Potential from eQTL Data
title_fullStr Learning a Prior on Regulatory Potential from eQTL Data
title_full_unstemmed Learning a Prior on Regulatory Potential from eQTL Data
title_short Learning a Prior on Regulatory Potential from eQTL Data
title_sort learning a prior on regulatory potential from eqtl data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2627940/
https://www.ncbi.nlm.nih.gov/pubmed/19180192
http://dx.doi.org/10.1371/journal.pgen.1000358
work_keys_str_mv AT leesuin learningaprioronregulatorypotentialfromeqtldata
AT dudleyaimeem learningaprioronregulatorypotentialfromeqtldata
AT drubindavid learningaprioronregulatorypotentialfromeqtldata
AT silverpamelaa learningaprioronregulatorypotentialfromeqtldata
AT krogannevanj learningaprioronregulatorypotentialfromeqtldata
AT peerdana learningaprioronregulatorypotentialfromeqtldata
AT kollerdaphne learningaprioronregulatorypotentialfromeqtldata