Cargando…

Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices

Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (...

Descripción completa

Detalles Bibliográficos
Autores principales: Oh, Young Min, Kim, Jong Kyoung, Choi, Seungjin, Yoo, Joo-Yeon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3300004/
https://www.ncbi.nlm.nih.gov/pubmed/22187154
http://dx.doi.org/10.1093/nar/gkr1252
_version_ 1782226190351728640
author Oh, Young Min
Kim, Jong Kyoung
Choi, Seungjin
Yoo, Joo-Yeon
author_facet Oh, Young Min
Kim, Jong Kyoung
Choi, Seungjin
Yoo, Joo-Yeon
author_sort Oh, Young Min
collection PubMed
description Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions.
format Online
Article
Text
id pubmed-3300004
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33000042012-03-13 Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices Oh, Young Min Kim, Jong Kyoung Choi, Seungjin Yoo, Joo-Yeon Nucleic Acids Res Methods Online Accurate prediction of transcription factor binding sites (TFBSs) is a prerequisite for identifying cis-regulatory modules that underlie transcriptional regulatory circuits encoded in the genome. Here, we present a computational framework for detecting TFBSs, when multiple position weight matrices (PWMs) for a transcription factor are available. Grouping multiple PWMs of a transcription factor (TF) based on their sequence similarity improves the specificity of TFBS prediction, which was evaluated using multiple genome-wide ChIP-Seq data sets from 26 TFs. The Z-scores of the area under a receiver operating characteristic curve (AUC) values of 368 TFs were calculated and used to statistically identify co-occurring regulatory motifs in the TF bound ChIP loci. Motifs that are co-occurring along with the empirical bindings of E2F, JUN or MYC have been evaluated, in the basal or stimulated condition. Results prove our method can be useful to systematically identify the co-occurring motifs of the TF for the given conditions. Oxford University Press 2012-03 2011-12-19 /pmc/articles/PMC3300004/ /pubmed/22187154 http://dx.doi.org/10.1093/nar/gkr1252 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Oh, Young Min
Kim, Jong Kyoung
Choi, Seungjin
Yoo, Joo-Yeon
Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title_full Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title_fullStr Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title_full_unstemmed Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title_short Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices
title_sort identification of co-occurring transcription factor binding sites from dna sequence using clustered position weight matrices
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3300004/
https://www.ncbi.nlm.nih.gov/pubmed/22187154
http://dx.doi.org/10.1093/nar/gkr1252
work_keys_str_mv AT ohyoungmin identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices
AT kimjongkyoung identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices
AT choiseungjin identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices
AT yoojooyeon identificationofcooccurringtranscriptionfactorbindingsitesfromdnasequenceusingclusteredpositionweightmatrices