Cargando…

HOT or not: examining the basis of high-occupancy target regions

High-occupancy target (HOT) regions are segments of the genome with unusually high number of transcription factor binding sites. These regions are observed in multiple species and thought to have biological importance due to high transcription factor occupancy. Furthermore, they coincide with house-...

Descripción completa

Detalles Bibliográficos
Autores principales: Wreczycka, Katarzyna, Franke, Vedran, Uyar, Bora, Wurmus, Ricardo, Bulut, Selman, Tursun, Baris, Akalin, Altuna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582337/
https://www.ncbi.nlm.nih.gov/pubmed/31114922
http://dx.doi.org/10.1093/nar/gkz460
_version_ 1783428303111061504
author Wreczycka, Katarzyna
Franke, Vedran
Uyar, Bora
Wurmus, Ricardo
Bulut, Selman
Tursun, Baris
Akalin, Altuna
author_facet Wreczycka, Katarzyna
Franke, Vedran
Uyar, Bora
Wurmus, Ricardo
Bulut, Selman
Tursun, Baris
Akalin, Altuna
author_sort Wreczycka, Katarzyna
collection PubMed
description High-occupancy target (HOT) regions are segments of the genome with unusually high number of transcription factor binding sites. These regions are observed in multiple species and thought to have biological importance due to high transcription factor occupancy. Furthermore, they coincide with house-keeping gene promoters and consequently associated genes are stably expressed across multiple cell types. Despite these features, HOT regions are solely defined using ChIP-seq experiments and shown to lack canonical motifs for transcription factors that are thought to be bound there. Although, ChIP-seq experiments are the golden standard for finding genome-wide binding sites of a protein, they are not noise free. Here, we show that HOT regions are likely to be ChIP-seq artifacts and they are similar to previously proposed ‘hyper-ChIPable’ regions. Using ChIP-seq data sets for knocked-out transcription factors, we demonstrate presence of false positive signals on HOT regions. We observe sequence characteristics and genomic features that are discriminatory of HOT regions, such as GC/CpG-rich k-mers, enrichment of RNA–DNA hybrids (R-loops) and DNA tertiary structures (G-quadruplex DNA). The artificial ChIP-seq enrichment on HOT regions could be associated to these discriminatory features. Furthermore, we propose strategies to deal with such artifacts for the future ChIP-seq studies.
format Online
Article
Text
id pubmed-6582337
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-65823372019-06-21 HOT or not: examining the basis of high-occupancy target regions Wreczycka, Katarzyna Franke, Vedran Uyar, Bora Wurmus, Ricardo Bulut, Selman Tursun, Baris Akalin, Altuna Nucleic Acids Res Genomics High-occupancy target (HOT) regions are segments of the genome with unusually high number of transcription factor binding sites. These regions are observed in multiple species and thought to have biological importance due to high transcription factor occupancy. Furthermore, they coincide with house-keeping gene promoters and consequently associated genes are stably expressed across multiple cell types. Despite these features, HOT regions are solely defined using ChIP-seq experiments and shown to lack canonical motifs for transcription factors that are thought to be bound there. Although, ChIP-seq experiments are the golden standard for finding genome-wide binding sites of a protein, they are not noise free. Here, we show that HOT regions are likely to be ChIP-seq artifacts and they are similar to previously proposed ‘hyper-ChIPable’ regions. Using ChIP-seq data sets for knocked-out transcription factors, we demonstrate presence of false positive signals on HOT regions. We observe sequence characteristics and genomic features that are discriminatory of HOT regions, such as GC/CpG-rich k-mers, enrichment of RNA–DNA hybrids (R-loops) and DNA tertiary structures (G-quadruplex DNA). The artificial ChIP-seq enrichment on HOT regions could be associated to these discriminatory features. Furthermore, we propose strategies to deal with such artifacts for the future ChIP-seq studies. Oxford University Press 2019-06-20 2019-05-22 /pmc/articles/PMC6582337/ /pubmed/31114922 http://dx.doi.org/10.1093/nar/gkz460 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics
Wreczycka, Katarzyna
Franke, Vedran
Uyar, Bora
Wurmus, Ricardo
Bulut, Selman
Tursun, Baris
Akalin, Altuna
HOT or not: examining the basis of high-occupancy target regions
title HOT or not: examining the basis of high-occupancy target regions
title_full HOT or not: examining the basis of high-occupancy target regions
title_fullStr HOT or not: examining the basis of high-occupancy target regions
title_full_unstemmed HOT or not: examining the basis of high-occupancy target regions
title_short HOT or not: examining the basis of high-occupancy target regions
title_sort hot or not: examining the basis of high-occupancy target regions
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6582337/
https://www.ncbi.nlm.nih.gov/pubmed/31114922
http://dx.doi.org/10.1093/nar/gkz460
work_keys_str_mv AT wreczyckakatarzyna hotornotexaminingthebasisofhighoccupancytargetregions
AT frankevedran hotornotexaminingthebasisofhighoccupancytargetregions
AT uyarbora hotornotexaminingthebasisofhighoccupancytargetregions
AT wurmusricardo hotornotexaminingthebasisofhighoccupancytargetregions
AT bulutselman hotornotexaminingthebasisofhighoccupancytargetregions
AT tursunbaris hotornotexaminingthebasisofhighoccupancytargetregions
AT akalinaltuna hotornotexaminingthebasisofhighoccupancytargetregions