Cargando…

TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors

One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networ...

Descripción completa

Detalles Bibliográficos
Autores principales: Eichner, Johannes, Topf, Florian, Dräger, Andreas, Wrzodek, Clemens, Wanke, Dierk, Zell, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861411/
https://www.ncbi.nlm.nih.gov/pubmed/24349230
http://dx.doi.org/10.1371/journal.pone.0082238
_version_ 1782295629923352576
author Eichner, Johannes
Topf, Florian
Dräger, Andreas
Wrzodek, Clemens
Wanke, Dierk
Zell, Andreas
author_facet Eichner, Johannes
Topf, Florian
Dräger, Andreas
Wrzodek, Clemens
Wanke, Dierk
Zell, Andreas
author_sort Eichner, Johannes
collection PubMed
description One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/.
format Online
Article
Text
id pubmed-3861411
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38614112013-12-17 TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors Eichner, Johannes Topf, Florian Dräger, Andreas Wrzodek, Clemens Wanke, Dierk Zell, Andreas PLoS One Research Article One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF) and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1) discriminates TFs from other proteins, (2) determines the structural superclass of TFs, (3) identifies the DNA-binding domains of TFs and (4) predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/. Public Library of Science 2013-12-12 /pmc/articles/PMC3861411/ /pubmed/24349230 http://dx.doi.org/10.1371/journal.pone.0082238 Text en © 2013 Eichner et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Eichner, Johannes
Topf, Florian
Dräger, Andreas
Wrzodek, Clemens
Wanke, Dierk
Zell, Andreas
TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title_full TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title_fullStr TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title_full_unstemmed TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title_short TFpredict and SABINE: Sequence-Based Prediction of Structural and Functional Characteristics of Transcription Factors
title_sort tfpredict and sabine: sequence-based prediction of structural and functional characteristics of transcription factors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3861411/
https://www.ncbi.nlm.nih.gov/pubmed/24349230
http://dx.doi.org/10.1371/journal.pone.0082238
work_keys_str_mv AT eichnerjohannes tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors
AT topfflorian tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors
AT dragerandreas tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors
AT wrzodekclemens tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors
AT wankedierk tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors
AT zellandreas tfpredictandsabinesequencebasedpredictionofstructuralandfunctionalcharacteristicsoftranscriptionfactors