Cargando…

Reconstruction of novel transcription factor regulons through inference of their binding sites

BACKGROUND: In most sequenced organisms the number of known regulatory genes (e.g., transcription factors (TFs)) vastly exceeds the number of experimentally-verified regulons that could be associated with them. At present, identification of TF regulons is mostly done through comparative genomics app...

Descripción completa

Detalles Bibliográficos
Autores principales: Elmas, Abdulkadir, Wang, Xiaodong, Samoilov, Michael S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4576408/
https://www.ncbi.nlm.nih.gov/pubmed/26388177
http://dx.doi.org/10.1186/s12859-015-0685-y
_version_ 1782390868880130048
author Elmas, Abdulkadir
Wang, Xiaodong
Samoilov, Michael S.
author_facet Elmas, Abdulkadir
Wang, Xiaodong
Samoilov, Michael S.
author_sort Elmas, Abdulkadir
collection PubMed
description BACKGROUND: In most sequenced organisms the number of known regulatory genes (e.g., transcription factors (TFs)) vastly exceeds the number of experimentally-verified regulons that could be associated with them. At present, identification of TF regulons is mostly done through comparative genomics approaches. Such methods could miss organism-specific regulatory interactions and often require expensive and time-consuming experimental techniques to generate the underlying data. RESULTS: In this work, we present an efficient algorithm that aims to identify a given transcription factor’s regulon through inference of its unknown binding sites, based on the discovery of its binding motif. The proposed approach relies on computational methods that utilize gene expression data sets and knockout fitness data sets which are available or may be straightforwardly obtained for many organisms. We computationally constructed the profiles of putative regulons for the TFs LexA, PurR and Fur in E. coli K12 and identified their binding motifs. Comparisons with an experimentally-verified database showed high recovery rates of the known regulon members, and indicated good predictions for the newly found genes with high biological significance. The proposed approach is also applicable to novel organisms for predicting unknown regulons of the transcriptional regulators. Results for the hypothetical protein D d e0289 in D. alaskensis include the discovery of a Fis-type TF binding motif. CONCLUSIONS: The proposed motif-based regulon inference approach can discover the organism-specific regulatory interactions on a single genome, which may be missed by current comparative genomics techniques due to their limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0685-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4576408
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45764082015-09-22 Reconstruction of novel transcription factor regulons through inference of their binding sites Elmas, Abdulkadir Wang, Xiaodong Samoilov, Michael S. BMC Bioinformatics Methodology Article BACKGROUND: In most sequenced organisms the number of known regulatory genes (e.g., transcription factors (TFs)) vastly exceeds the number of experimentally-verified regulons that could be associated with them. At present, identification of TF regulons is mostly done through comparative genomics approaches. Such methods could miss organism-specific regulatory interactions and often require expensive and time-consuming experimental techniques to generate the underlying data. RESULTS: In this work, we present an efficient algorithm that aims to identify a given transcription factor’s regulon through inference of its unknown binding sites, based on the discovery of its binding motif. The proposed approach relies on computational methods that utilize gene expression data sets and knockout fitness data sets which are available or may be straightforwardly obtained for many organisms. We computationally constructed the profiles of putative regulons for the TFs LexA, PurR and Fur in E. coli K12 and identified their binding motifs. Comparisons with an experimentally-verified database showed high recovery rates of the known regulon members, and indicated good predictions for the newly found genes with high biological significance. The proposed approach is also applicable to novel organisms for predicting unknown regulons of the transcriptional regulators. Results for the hypothetical protein D d e0289 in D. alaskensis include the discovery of a Fis-type TF binding motif. CONCLUSIONS: The proposed motif-based regulon inference approach can discover the organism-specific regulatory interactions on a single genome, which may be missed by current comparative genomics techniques due to their limitations. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-015-0685-y) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-21 /pmc/articles/PMC4576408/ /pubmed/26388177 http://dx.doi.org/10.1186/s12859-015-0685-y Text en © Elmas et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Elmas, Abdulkadir
Wang, Xiaodong
Samoilov, Michael S.
Reconstruction of novel transcription factor regulons through inference of their binding sites
title Reconstruction of novel transcription factor regulons through inference of their binding sites
title_full Reconstruction of novel transcription factor regulons through inference of their binding sites
title_fullStr Reconstruction of novel transcription factor regulons through inference of their binding sites
title_full_unstemmed Reconstruction of novel transcription factor regulons through inference of their binding sites
title_short Reconstruction of novel transcription factor regulons through inference of their binding sites
title_sort reconstruction of novel transcription factor regulons through inference of their binding sites
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4576408/
https://www.ncbi.nlm.nih.gov/pubmed/26388177
http://dx.doi.org/10.1186/s12859-015-0685-y
work_keys_str_mv AT elmasabdulkadir reconstructionofnoveltranscriptionfactorregulonsthroughinferenceoftheirbindingsites
AT wangxiaodong reconstructionofnoveltranscriptionfactorregulonsthroughinferenceoftheirbindingsites
AT samoilovmichaels reconstructionofnoveltranscriptionfactorregulonsthroughinferenceoftheirbindingsites