Cargando…

MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes

BACKGROUND: Cis-regulatory modules are combinations of regulatory elements occurring in close proximity to each other that control the spatial and temporal expression of genes. The ability to identify them in a genome-wide manner depends on the availability of accurate models and of search methods a...

Descripción completa

Detalles Bibliográficos
Autores principales: Marinescu, Voichita D, Kohane, Isaac S, Riva, Alberto
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1131891/
https://www.ncbi.nlm.nih.gov/pubmed/15799782
http://dx.doi.org/10.1186/1471-2105-6-79
_version_ 1782123952230891520
author Marinescu, Voichita D
Kohane, Isaac S
Riva, Alberto
author_facet Marinescu, Voichita D
Kohane, Isaac S
Riva, Alberto
author_sort Marinescu, Voichita D
collection PubMed
description BACKGROUND: Cis-regulatory modules are combinations of regulatory elements occurring in close proximity to each other that control the spatial and temporal expression of genes. The ability to identify them in a genome-wide manner depends on the availability of accurate models and of search methods able to detect putative regulatory elements with enhanced sensitivity and specificity. RESULTS: We describe the implementation of a search method for putative transcription factor binding sites (TFBSs) based on hidden Markov models built from alignments of known sites. We built 1,079 models of TFBSs using experimentally determined sequence alignments of sites provided by the TRANSFAC and JASPAR databases and used them to scan sequences of the human, mouse, fly, worm and yeast genomes. In several cases tested the method identified correctly experimentally characterized sites, with better specificity and sensitivity than other similar computational methods. Moreover, a large-scale comparison using synthetic data showed that in the majority of cases our method performed significantly better than a nucleotide weight matrix-based method. CONCLUSION: The search engine, available at , allows the identification, visualization and selection of putative TFBSs occurring in the promoter or other regions of a gene from the human, mouse, fly, worm and yeast genomes. In addition it allows the user to upload a sequence to query and to build a model by supplying a multiple sequence alignment of binding sites for a transcription factor of interest. Due to its extensive database of models, powerful search engine and flexible interface, MAPPER represents an effective resource for the large-scale computational analysis of transcriptional regulation.
format Text
id pubmed-1131891
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11318912005-05-20 MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes Marinescu, Voichita D Kohane, Isaac S Riva, Alberto BMC Bioinformatics Software BACKGROUND: Cis-regulatory modules are combinations of regulatory elements occurring in close proximity to each other that control the spatial and temporal expression of genes. The ability to identify them in a genome-wide manner depends on the availability of accurate models and of search methods able to detect putative regulatory elements with enhanced sensitivity and specificity. RESULTS: We describe the implementation of a search method for putative transcription factor binding sites (TFBSs) based on hidden Markov models built from alignments of known sites. We built 1,079 models of TFBSs using experimentally determined sequence alignments of sites provided by the TRANSFAC and JASPAR databases and used them to scan sequences of the human, mouse, fly, worm and yeast genomes. In several cases tested the method identified correctly experimentally characterized sites, with better specificity and sensitivity than other similar computational methods. Moreover, a large-scale comparison using synthetic data showed that in the majority of cases our method performed significantly better than a nucleotide weight matrix-based method. CONCLUSION: The search engine, available at , allows the identification, visualization and selection of putative TFBSs occurring in the promoter or other regions of a gene from the human, mouse, fly, worm and yeast genomes. In addition it allows the user to upload a sequence to query and to build a model by supplying a multiple sequence alignment of binding sites for a transcription factor of interest. Due to its extensive database of models, powerful search engine and flexible interface, MAPPER represents an effective resource for the large-scale computational analysis of transcriptional regulation. BioMed Central 2005-03-30 /pmc/articles/PMC1131891/ /pubmed/15799782 http://dx.doi.org/10.1186/1471-2105-6-79 Text en Copyright © 2005 Marinescu et al; licensee BioMed Central Ltd.
spellingShingle Software
Marinescu, Voichita D
Kohane, Isaac S
Riva, Alberto
MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title_full MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title_fullStr MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title_full_unstemmed MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title_short MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
title_sort mapper: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1131891/
https://www.ncbi.nlm.nih.gov/pubmed/15799782
http://dx.doi.org/10.1186/1471-2105-6-79
work_keys_str_mv AT marinescuvoichitad mapperasearchengineforthecomputationalidentificationofputativetranscriptionfactorbindingsitesinmultiplegenomes
AT kohaneisaacs mapperasearchengineforthecomputationalidentificationofputativetranscriptionfactorbindingsitesinmultiplegenomes
AT rivaalberto mapperasearchengineforthecomputationalidentificationofputativetranscriptionfactorbindingsitesinmultiplegenomes