Cargando…
TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networ...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861411/ https://www.ncbi.nlm.nih.gov/pubmed/24349230 http://dx.doi.org/10.1371/journal.pone.0082238 |
_version_ | 1782295629923352576 |
---|---|
author | Eichner, Johannes Topf, Florian Dräger, Andreas Wrzodek, Clemens Wanke, Dierk Zell, Andreas |
author_facet | Eichner, Johannes Topf, Florian Dräger, Andreas Wrzodek, Clemens Wanke, Dierk Zell, Andreas |
author_sort | Eichner, Johannes |
collection | PubMed |
description | One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/. |
format | Online Article Text |
id | pubmed-3861411 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38614112013-12-17 TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors Eichner, Johannes Topf, Florian Dräger, Andreas Wrzodek, Clemens Wanke, Dierk Zell, Andreas PLoS One Research Article One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/. Public Library of Science 2013-12-12 /pmc/articles/PMC3861411/ /pubmed/24349230 http://dx.doi.org/10.1371/journal.pone.0082238 Text en © 2013 Eichner et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Eichner, Johannes Topf, Florian Dräger, Andreas Wrzodek, Clemens Wanke, Dierk Zell, Andreas TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title | TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title_full | TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title_fullStr | TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title_full_unstemmed | TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title_short | TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors |
title_sort | tfpredict and sabine: sequence-based prediction of structural and functional characteristics of transcription factors |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861411/ https://www.ncbi.nlm.nih.gov/pubmed/24349230 http://dx.doi.org/10.1371/journal.pone.0082238 |
work_keys_str_mv | AT eichnerjohannes tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors AT topfflorian tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors AT dragerandreas tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors AT wrzodekclemens tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors AT wankedierk tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors AT zellandreas tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors |