Cargando…

CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes

BACKGROUND: The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-r...

Descripción completa

Detalles Bibliográficos
Autores principales: Hestand, Matthew S, van Galen, Michiel, Villerius, Michel P, van Ommen, Gert-Jan B, den Dunnen, Johan T, 't Hoen, Peter AC
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613159/
https://www.ncbi.nlm.nih.gov/pubmed/19036135
http://dx.doi.org/10.1186/1471-2105-9-495
_version_ 1782163161794740224
author Hestand, Matthew S
van Galen, Michiel
Villerius, Michel P
van Ommen, Gert-Jan B
den Dunnen, Johan T
't Hoen, Peter AC
author_facet Hestand, Matthew S
van Galen, Michiel
Villerius, Michel P
van Ommen, Gert-Jan B
den Dunnen, Johan T
't Hoen, Peter AC
author_sort Hestand, Matthew S
collection PubMed
description BACKGROUND: The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. RESULTS: We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFAC(R )database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. CONCLUSION: The program CORE_TF is accessible in a user friendly web interface at . It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites.
format Text
id pubmed-2613159
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26131592009-01-01 CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes Hestand, Matthew S van Galen, Michiel Villerius, Michel P van Ommen, Gert-Jan B den Dunnen, Johan T 't Hoen, Peter AC BMC Bioinformatics Software BACKGROUND: The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. RESULTS: We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFAC(R )database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. CONCLUSION: The program CORE_TF is accessible in a user friendly web interface at . It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites. BioMed Central 2008-11-26 /pmc/articles/PMC2613159/ /pubmed/19036135 http://dx.doi.org/10.1186/1471-2105-9-495 Text en Copyright © 2008 Hestand et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Hestand, Matthew S
van Galen, Michiel
Villerius, Michel P
van Ommen, Gert-Jan B
den Dunnen, Johan T
't Hoen, Peter AC
CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title_full CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title_fullStr CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title_full_unstemmed CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title_short CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
title_sort core_tf: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2613159/
https://www.ncbi.nlm.nih.gov/pubmed/19036135
http://dx.doi.org/10.1186/1471-2105-9-495
work_keys_str_mv AT hestandmatthews coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes
AT vangalenmichiel coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes
AT villeriusmichelp coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes
AT vanommengertjanb coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes
AT dendunnenjohant coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes
AT thoenpeterac coretfauserfriendlyinterfacetoidentifyevolutionaryconservedtranscriptionfactorbindingsitesinsetsofcoregulatedgenes