Cargando…

MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites

Chromatin immunoprecipitation combined with massively parallel sequencing methods (ChIP-seq) is becoming the standard approach to study interactions of transcription factors (TF) with genomic sequences. At the example of public STAT1 ChIP-seq data sets, we present novel approaches for the interpreta...

Descripción completa

Detalles Bibliográficos
Autores principales: Schmid, Christoph D., Bucher, Philipp
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2897888/
https://www.ncbi.nlm.nih.gov/pubmed/20625510
http://dx.doi.org/10.1371/journal.pone.0011425
_version_ 1782183452429254656
author Schmid, Christoph D.
Bucher, Philipp
author_facet Schmid, Christoph D.
Bucher, Philipp
author_sort Schmid, Christoph D.
collection PubMed
description Chromatin immunoprecipitation combined with massively parallel sequencing methods (ChIP-seq) is becoming the standard approach to study interactions of transcription factors (TF) with genomic sequences. At the example of public STAT1 ChIP-seq data sets, we present novel approaches for the interpretation of ChIP-seq data. We compare recently developed approaches to determine STAT1 binding sites from ChIP-seq data. Assessing the content of the established consensus sequence for STAT1 binding sites, we find that the usage of “negative control” ChIP-seq data fails to provide substantial advantages. We derive a single refined probabilistic model of STAT1 binding sequences from these ChIP-seq data. Contrary to previous claims, we find no evidence that STAT1 binds to multiple distinct motifs upon interferon-gamma stimulation in vivo. While a large majority of genomic sites with high ChIP-seq signal is associated with a nucleotide sequence ressembling a STAT1 binding site, only a very small subset of the over 5 million potential STAT1 binding sites in the human genome is covered by ChIP-seq data. Furthermore a surprisingly large fraction of the ChIP-seq signal (5%) is absorbed by a small family of repetitive sequences (MER41). The observation of the binding of activated STAT1 protein to a specific repetitive element bolsters similar reports concerning p53 and other TFs, and strengthens the notion of an involvement of repeats in gene regulation. Incidentally MER41 are specific to primates, consequently, regulatory mechanisms in the IFN-STAT pathway might fundamentally differ between primates and rodents. On a methodological aspect, the presence of large numbers of nearly identical binding sites in repetitive sequences may lead to wrong conclusions about intrinsic binding preferences of TF as illustrated by the spacing analysis STAT1 tandem motifs. Therefore, ChIP-seq data should be analyzed independently within repetitive and non-repetitive sequences.
format Text
id pubmed-2897888
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28978882010-07-12 MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites Schmid, Christoph D. Bucher, Philipp PLoS One Research Article Chromatin immunoprecipitation combined with massively parallel sequencing methods (ChIP-seq) is becoming the standard approach to study interactions of transcription factors (TF) with genomic sequences. At the example of public STAT1 ChIP-seq data sets, we present novel approaches for the interpretation of ChIP-seq data. We compare recently developed approaches to determine STAT1 binding sites from ChIP-seq data. Assessing the content of the established consensus sequence for STAT1 binding sites, we find that the usage of “negative control” ChIP-seq data fails to provide substantial advantages. We derive a single refined probabilistic model of STAT1 binding sequences from these ChIP-seq data. Contrary to previous claims, we find no evidence that STAT1 binds to multiple distinct motifs upon interferon-gamma stimulation in vivo. While a large majority of genomic sites with high ChIP-seq signal is associated with a nucleotide sequence ressembling a STAT1 binding site, only a very small subset of the over 5 million potential STAT1 binding sites in the human genome is covered by ChIP-seq data. Furthermore a surprisingly large fraction of the ChIP-seq signal (5%) is absorbed by a small family of repetitive sequences (MER41). The observation of the binding of activated STAT1 protein to a specific repetitive element bolsters similar reports concerning p53 and other TFs, and strengthens the notion of an involvement of repeats in gene regulation. Incidentally MER41 are specific to primates, consequently, regulatory mechanisms in the IFN-STAT pathway might fundamentally differ between primates and rodents. On a methodological aspect, the presence of large numbers of nearly identical binding sites in repetitive sequences may lead to wrong conclusions about intrinsic binding preferences of TF as illustrated by the spacing analysis STAT1 tandem motifs. Therefore, ChIP-seq data should be analyzed independently within repetitive and non-repetitive sequences. Public Library of Science 2010-07-06 /pmc/articles/PMC2897888/ /pubmed/20625510 http://dx.doi.org/10.1371/journal.pone.0011425 Text en Schmid, Bucher. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Schmid, Christoph D.
Bucher, Philipp
MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title_full MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title_fullStr MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title_full_unstemmed MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title_short MER41 Repeat Sequences Contain Inducible STAT1 Binding Sites
title_sort mer41 repeat sequences contain inducible stat1 binding sites
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2897888/
https://www.ncbi.nlm.nih.gov/pubmed/20625510
http://dx.doi.org/10.1371/journal.pone.0011425
work_keys_str_mv AT schmidchristophd mer41repeatsequencescontaininduciblestat1bindingsites
AT bucherphilipp mer41repeatsequencescontaininduciblestat1bindingsites