Cargando…

ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species

Saccharomyces cerevisiae is a primary model for studies of transcriptional control, and the specificities of most yeast transcription factors (TFs) have been determined by multiple methods. However, it is unclear which position weight matrices (PWMs) are most useful; for the roughly 200 TFs in yeast...

Descripción completa

Detalles Bibliográficos
Autores principales: Spivak, Aaron T., Stormo, Gary D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3245033/
https://www.ncbi.nlm.nih.gov/pubmed/22140105
http://dx.doi.org/10.1093/nar/gkr1180
_version_ 1782219784665956352
author Spivak, Aaron T.
Stormo, Gary D.
author_facet Spivak, Aaron T.
Stormo, Gary D.
author_sort Spivak, Aaron T.
collection PubMed
description Saccharomyces cerevisiae is a primary model for studies of transcriptional control, and the specificities of most yeast transcription factors (TFs) have been determined by multiple methods. However, it is unclear which position weight matrices (PWMs) are most useful; for the roughly 200 TFs in yeast, there are over 1200 PWMs in the literature. To address this issue, we created ScerTF, a comprehensive database of 1226 motifs from 11 different sources. We identified a single matrix for each TF that best predicts in vivo data by benchmarking matrices against chromatin immunoprecipitation and TF deletion experiments. We also used in vivo data to optimize thresholds for identifying regulatory sites with each matrix. To correct for biases from different methods, we developed a strategy to combine matrices. These aligned matrices outperform the best available matrix for several TFs. We used the matrices to predict co-occurring regulatory elements in the genome and identified many known TF combinations. In addition, we predict new combinations and provide evidence of combinatorial regulation from gene expression data. The database is available through a web interface at http://ural.wustl.edu/ScerTF. The site allows users to search the database with a regulatory site or matrix to identify the TFs most likely to bind the input sequence.
format Online
Article
Text
id pubmed-3245033
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-32450332012-01-10 ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species Spivak, Aaron T. Stormo, Gary D. Nucleic Acids Res Articles Saccharomyces cerevisiae is a primary model for studies of transcriptional control, and the specificities of most yeast transcription factors (TFs) have been determined by multiple methods. However, it is unclear which position weight matrices (PWMs) are most useful; for the roughly 200 TFs in yeast, there are over 1200 PWMs in the literature. To address this issue, we created ScerTF, a comprehensive database of 1226 motifs from 11 different sources. We identified a single matrix for each TF that best predicts in vivo data by benchmarking matrices against chromatin immunoprecipitation and TF deletion experiments. We also used in vivo data to optimize thresholds for identifying regulatory sites with each matrix. To correct for biases from different methods, we developed a strategy to combine matrices. These aligned matrices outperform the best available matrix for several TFs. We used the matrices to predict co-occurring regulatory elements in the genome and identified many known TF combinations. In addition, we predict new combinations and provide evidence of combinatorial regulation from gene expression data. The database is available through a web interface at http://ural.wustl.edu/ScerTF. The site allows users to search the database with a regulatory site or matrix to identify the TFs most likely to bind the input sequence. Oxford University Press 2012-01 2011-12-02 /pmc/articles/PMC3245033/ /pubmed/22140105 http://dx.doi.org/10.1093/nar/gkr1180 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Spivak, Aaron T.
Stormo, Gary D.
ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title_full ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title_fullStr ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title_full_unstemmed ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title_short ScerTF: a comprehensive database of benchmarked position weight matrices for Saccharomyces species
title_sort scertf: a comprehensive database of benchmarked position weight matrices for saccharomyces species
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3245033/
https://www.ncbi.nlm.nih.gov/pubmed/22140105
http://dx.doi.org/10.1093/nar/gkr1180
work_keys_str_mv AT spivakaaront scertfacomprehensivedatabaseofbenchmarkedpositionweightmatricesforsaccharomycesspecies
AT stormogaryd scertfacomprehensivedatabaseofbenchmarkedpositionweightmatricesforsaccharomycesspecies