Cargando…

Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli

Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is rel...

Descripción completa

Detalles Bibliográficos
Autores principales: Meysman, Pieter, Dang, Thanh Hai, Laukens, Kris, De Smet, Riet, Wu, Yan, Marchal, Kathleen, Engelen, Kristof
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025552/
https://www.ncbi.nlm.nih.gov/pubmed/21051340
http://dx.doi.org/10.1093/nar/gkq1071
_version_ 1782196921608175616
author Meysman, Pieter
Dang, Thanh Hai
Laukens, Kris
De Smet, Riet
Wu, Yan
Marchal, Kathleen
Engelen, Kristof
author_facet Meysman, Pieter
Dang, Thanh Hai
Laukens, Kris
De Smet, Riet
Wu, Yan
Marchal, Kathleen
Engelen, Kristof
author_sort Meysman, Pieter
collection PubMed
description Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors.
format Text
id pubmed-3025552
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30255522011-01-24 Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli Meysman, Pieter Dang, Thanh Hai Laukens, Kris De Smet, Riet Wu, Yan Marchal, Kathleen Engelen, Kristof Nucleic Acids Res Methods Online Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors. Oxford University Press 2011-01 2010-11-04 /pmc/articles/PMC3025552/ /pubmed/21051340 http://dx.doi.org/10.1093/nar/gkq1071 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Meysman, Pieter
Dang, Thanh Hai
Laukens, Kris
De Smet, Riet
Wu, Yan
Marchal, Kathleen
Engelen, Kristof
Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title_full Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title_fullStr Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title_full_unstemmed Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title_short Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
title_sort use of structural dna properties for the prediction of transcription-factor binding sites in escherichia coli
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025552/
https://www.ncbi.nlm.nih.gov/pubmed/21051340
http://dx.doi.org/10.1093/nar/gkq1071
work_keys_str_mv AT meysmanpieter useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT dangthanhhai useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT laukenskris useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT desmetriet useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT wuyan useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT marchalkathleen useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli
AT engelenkristof useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli