Cargando…

Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA

BACKGROUND: Sites in DNA that bind regulatory proteins can be detected computationally in various ways. Pattern discovery methods analyze collections of genes suspected to be co-regulated on the evidence, for example, of clustering of transcriptome data. Pattern searching methods use sequences with...

Descripción completa

Detalles Bibliográficos
Autores principales: Benítez-Bellón, Esperanza, Moreno-Hagelsieb, Gabriel, Collado-Vides, Julio
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC88811/
https://www.ncbi.nlm.nih.gov/pubmed/11897025
_version_ 1782120187780136960
author Benítez-Bellón, Esperanza
Moreno-Hagelsieb, Gabriel
Collado-Vides, Julio
author_facet Benítez-Bellón, Esperanza
Moreno-Hagelsieb, Gabriel
Collado-Vides, Julio
author_sort Benítez-Bellón, Esperanza
collection PubMed
description BACKGROUND: Sites in DNA that bind regulatory proteins can be detected computationally in various ways. Pattern discovery methods analyze collections of genes suspected to be co-regulated on the evidence, for example, of clustering of transcriptome data. Pattern searching methods use sequences with known binding sites to find other genes regulated by a given protein. Such computational methods are important strategies in the discovery and elaboration of regulatory networks and can provide the experimental biologist with a precise prediction of a binding site or identify a gene as a member of a set of co-regulated genes (a regulon). As more variations on such methods are published, however, thorough evaluation is necessary, as performance may differ depending on the conditions of use. Detailed evaluation also helps to improve and understand the behavior of the different methods and computational strategies. RESULTS: We used a collection of 86 regulons from Escherichia coli as datasets to evaluate two methods for pattern discovery and pattern searching: dyad analysis/dyad sweeping using the program Dyad-analysis, and multiple alignment using the programs Consensus/Patser. Clearly defined statistical parameters are used to evaluate the two methods in different situations. We placed particular emphasis on minimizing the rate of false positives. CONCLUSIONS: As a general rule, sensors obtained from experimentally reported binding sites in DNA frequently locate true sites as the highest-scoring sequences within a given upstream region, especially using Consensus/Patser. Pattern discovery is still an unsolved problem, although in the cases where Dyad-analysis finds significant dyads (around 50%), these frequently correspond to true binding sites. With more robust methods, regulatory predictions could help identify the function of unknown genes.
format Text
id pubmed-88811
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-888112002-03-18 Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA Benítez-Bellón, Esperanza Moreno-Hagelsieb, Gabriel Collado-Vides, Julio Genome Biol Research BACKGROUND: Sites in DNA that bind regulatory proteins can be detected computationally in various ways. Pattern discovery methods analyze collections of genes suspected to be co-regulated on the evidence, for example, of clustering of transcriptome data. Pattern searching methods use sequences with known binding sites to find other genes regulated by a given protein. Such computational methods are important strategies in the discovery and elaboration of regulatory networks and can provide the experimental biologist with a precise prediction of a binding site or identify a gene as a member of a set of co-regulated genes (a regulon). As more variations on such methods are published, however, thorough evaluation is necessary, as performance may differ depending on the conditions of use. Detailed evaluation also helps to improve and understand the behavior of the different methods and computational strategies. RESULTS: We used a collection of 86 regulons from Escherichia coli as datasets to evaluate two methods for pattern discovery and pattern searching: dyad analysis/dyad sweeping using the program Dyad-analysis, and multiple alignment using the programs Consensus/Patser. Clearly defined statistical parameters are used to evaluate the two methods in different situations. We placed particular emphasis on minimizing the rate of false positives. CONCLUSIONS: As a general rule, sensors obtained from experimentally reported binding sites in DNA frequently locate true sites as the highest-scoring sequences within a given upstream region, especially using Consensus/Patser. Pattern discovery is still an unsolved problem, although in the cases where Dyad-analysis finds significant dyads (around 50%), these frequently correspond to true binding sites. With more robust methods, regulatory predictions could help identify the function of unknown genes. BioMed Central 2002 2002-02-21 /pmc/articles/PMC88811/ /pubmed/11897025 Text en Copyright © 2002 Benítez-Bellón et al., licensee BioMed Central Ltd
spellingShingle Research
Benítez-Bellón, Esperanza
Moreno-Hagelsieb, Gabriel
Collado-Vides, Julio
Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title_full Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title_fullStr Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title_full_unstemmed Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title_short Evaluation of thresholds for the detection of binding sites for regulatory proteins in Escherichia coli K12 DNA
title_sort evaluation of thresholds for the detection of binding sites for regulatory proteins in escherichia coli k12 dna
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC88811/
https://www.ncbi.nlm.nih.gov/pubmed/11897025
work_keys_str_mv AT benitezbellonesperanza evaluationofthresholdsforthedetectionofbindingsitesforregulatoryproteinsinescherichiacolik12dna
AT morenohagelsiebgabriel evaluationofthresholdsforthedetectionofbindingsitesforregulatoryproteinsinescherichiacolik12dna
AT colladovidesjulio evaluationofthresholdsforthedetectionofbindingsitesforregulatoryproteinsinescherichiacolik12dna