Cargando…
Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli
Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is rel...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025552/ https://www.ncbi.nlm.nih.gov/pubmed/21051340 http://dx.doi.org/10.1093/nar/gkq1071 |
_version_ | 1782196921608175616 |
---|---|
author | Meysman, Pieter Dang, Thanh Hai Laukens, Kris De Smet, Riet Wu, Yan Marchal, Kathleen Engelen, Kristof |
author_facet | Meysman, Pieter Dang, Thanh Hai Laukens, Kris De Smet, Riet Wu, Yan Marchal, Kathleen Engelen, Kristof |
author_sort | Meysman, Pieter |
collection | PubMed |
description | Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors. |
format | Text |
id | pubmed-3025552 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30255522011-01-24 Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli Meysman, Pieter Dang, Thanh Hai Laukens, Kris De Smet, Riet Wu, Yan Marchal, Kathleen Engelen, Kristof Nucleic Acids Res Methods Online Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors. Oxford University Press 2011-01 2010-11-04 /pmc/articles/PMC3025552/ /pubmed/21051340 http://dx.doi.org/10.1093/nar/gkq1071 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Meysman, Pieter Dang, Thanh Hai Laukens, Kris De Smet, Riet Wu, Yan Marchal, Kathleen Engelen, Kristof Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title | Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title_full | Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title_fullStr | Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title_full_unstemmed | Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title_short | Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli |
title_sort | use of structural dna properties for the prediction of transcription-factor binding sites in escherichia coli |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3025552/ https://www.ncbi.nlm.nih.gov/pubmed/21051340 http://dx.doi.org/10.1093/nar/gkq1071 |
work_keys_str_mv | AT meysmanpieter useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT dangthanhhai useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT laukenskris useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT desmetriet useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT wuyan useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT marchalkathleen useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli AT engelenkristof useofstructuraldnapropertiesforthepredictionoftranscriptionfactorbindingsitesinescherichiacoli |