Cargando…
Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3300004/ https://www.ncbi.nlm.nih.gov/pubmed/22187154 http://dx.doi.org/10.1093/nar/gkr1252 |
_version_ | 1782226190351728640 |
---|---|
author | Oh, Young Min Kim, Jong Kyoung Choi, Seungjin Yoo, Joo-Yeon |
author_facet | Oh, Young Min Kim, Jong Kyoung Choi, Seungjin Yoo, Joo-Yeon |
author_sort | Oh, Young Min |
collection | PubMed |
description | Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions. |
format | Online Article Text |
id | pubmed-3300004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-33000042012-03-13 Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices Oh, Young Min Kim, Jong Kyoung Choi, Seungjin Yoo, Joo-Yeon Nucleic Acids Res Methods Online Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions. Oxford University Press 2012-03 2011-12-19 /pmc/articles/PMC3300004/ /pubmed/22187154 http://dx.doi.org/10.1093/nar/gkr1252 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Oh, Young Min Kim, Jong Kyoung Choi, Seungjin Yoo, Joo-Yeon Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title | Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title_full | Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title_fullStr | Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title_full_unstemmed | Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title_short | Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices |
title_sort | identification of co-occurring transcription factor binding sites from dna sequence using clustered position weight matrices |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3300004/ https://www.ncbi.nlm.nih.gov/pubmed/22187154 http://dx.doi.org/10.1093/nar/gkr1252 |
work_keys_str_mv | AT ohyoungmin identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices AT kimjongkyoung identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices AT choiseungjin identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices AT yoojooyeon identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices |