Cargando…

Sequence signatures extracted from proximal promoters can be used to predict distal enhancers

BACKGROUND: Gene expression is controlled by proximal promoters and distal regulatory elements such as enhancers. While the activity of some promoters can be invariant across tissues, enhancers tend to be highly tissue-specific. RESULTS: We compiled sets of tissue-specific promoters based on gene ex...

Descripción completa

Detalles Bibliográficos
Autores principales: Taher, Leila, Smith, Robin P, Kim, Mee J, Ahituv, Nadav, Ovcharenko, Ivan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3983659/
https://www.ncbi.nlm.nih.gov/pubmed/24156763
http://dx.doi.org/10.1186/gb-2013-14-10-r117
_version_ 1782311355060060160
author Taher, Leila
Smith, Robin P
Kim, Mee J
Ahituv, Nadav
Ovcharenko, Ivan
author_facet Taher, Leila
Smith, Robin P
Kim, Mee J
Ahituv, Nadav
Ovcharenko, Ivan
author_sort Taher, Leila
collection PubMed
description BACKGROUND: Gene expression is controlled by proximal promoters and distal regulatory elements such as enhancers. While the activity of some promoters can be invariant across tissues, enhancers tend to be highly tissue-specific. RESULTS: We compiled sets of tissue-specific promoters based on gene expression profiles of 79 human tissues and cell types. Putative transcription factor binding sites within each set of sequences were used to train a support vector machine classifier capable of distinguishing tissue-specific promoters from control sequences. We obtained reliable classifiers for 92% of the tissues, with an area under the receiver operating characteristic curve between 60% (for subthalamic nucleus promoters) and 98% (for heart promoters). We next used these classifiers to identify tissue-specific enhancers, scanning distal non-coding sequences in the loci of the 200 most highly and lowly expressed genes. Thirty percent of reliable classifiers produced consistent enhancer predictions, with significantly higher densities in the loci of the most highly expressed compared to lowly expressed genes. Liver enhancer predictions were assessed in vivo using the hydrodynamic tail vein injection assay. Fifty-eight percent of the predictions yielded significant enhancer activity in the mouse liver, whereas a control set of five sequences was completely negative. CONCLUSIONS: We conclude that promoters of tissue-specific genes often contain unambiguous tissue-specific signatures that can be learned and used for the de novo prediction of enhancers.
format Online
Article
Text
id pubmed-3983659
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39836592014-04-25 Sequence signatures extracted from proximal promoters can be used to predict distal enhancers Taher, Leila Smith, Robin P Kim, Mee J Ahituv, Nadav Ovcharenko, Ivan Genome Biol Research BACKGROUND: Gene expression is controlled by proximal promoters and distal regulatory elements such as enhancers. While the activity of some promoters can be invariant across tissues, enhancers tend to be highly tissue-specific. RESULTS: We compiled sets of tissue-specific promoters based on gene expression profiles of 79 human tissues and cell types. Putative transcription factor binding sites within each set of sequences were used to train a support vector machine classifier capable of distinguishing tissue-specific promoters from control sequences. We obtained reliable classifiers for 92% of the tissues, with an area under the receiver operating characteristic curve between 60% (for subthalamic nucleus promoters) and 98% (for heart promoters). We next used these classifiers to identify tissue-specific enhancers, scanning distal non-coding sequences in the loci of the 200 most highly and lowly expressed genes. Thirty percent of reliable classifiers produced consistent enhancer predictions, with significantly higher densities in the loci of the most highly expressed compared to lowly expressed genes. Liver enhancer predictions were assessed in vivo using the hydrodynamic tail vein injection assay. Fifty-eight percent of the predictions yielded significant enhancer activity in the mouse liver, whereas a control set of five sequences was completely negative. CONCLUSIONS: We conclude that promoters of tissue-specific genes often contain unambiguous tissue-specific signatures that can be learned and used for the de novo prediction of enhancers. BioMed Central 2013 2013-10-24 /pmc/articles/PMC3983659/ /pubmed/24156763 http://dx.doi.org/10.1186/gb-2013-14-10-r117 Text en Copyright © 2013 Taher et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Taher, Leila
Smith, Robin P
Kim, Mee J
Ahituv, Nadav
Ovcharenko, Ivan
Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title_full Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title_fullStr Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title_full_unstemmed Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title_short Sequence signatures extracted from proximal promoters can be used to predict distal enhancers
title_sort sequence signatures extracted from proximal promoters can be used to predict distal enhancers
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3983659/
https://www.ncbi.nlm.nih.gov/pubmed/24156763
http://dx.doi.org/10.1186/gb-2013-14-10-r117
work_keys_str_mv AT taherleila sequencesignaturesextractedfromproximalpromoterscanbeusedtopredictdistalenhancers
AT smithrobinp sequencesignaturesextractedfromproximalpromoterscanbeusedtopredictdistalenhancers
AT kimmeej sequencesignaturesextractedfromproximalpromoterscanbeusedtopredictdistalenhancers
AT ahituvnadav sequencesignaturesextractedfromproximalpromoterscanbeusedtopredictdistalenhancers
AT ovcharenkoivan sequencesignaturesextractedfromproximalpromoterscanbeusedtopredictdistalenhancers