Cargando…

Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline

Mapping genome-wide data to human subtelomeres has been problematic due to the incomplete assembly and challenges of low-copy repetitive DNA elements. Here, we provide updated human subtelomere sequence assemblies that were extended by filling telomere-adjacent gaps using clone-based resources. A bi...

Descripción completa

Detalles Bibliográficos
Autores principales: Stong, Nicholas, Deng, Zhong, Gupta, Ravi, Hu, Sufen, Paul, Shiela, Weiner, Amber K., Eichler, Evan E., Graves, Tina, Fronick, Catrina C., Courtney, Laura, Wilson, Richard K., Lieberman, Paul M., Davuluri, Ramana V., Riethman, Harold
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032850/
https://www.ncbi.nlm.nih.gov/pubmed/24676094
http://dx.doi.org/10.1101/gr.166983.113
_version_ 1782317709904576512
author Stong, Nicholas
Deng, Zhong
Gupta, Ravi
Hu, Sufen
Paul, Shiela
Weiner, Amber K.
Eichler, Evan E.
Graves, Tina
Fronick, Catrina C.
Courtney, Laura
Wilson, Richard K.
Lieberman, Paul M.
Davuluri, Ramana V.
Riethman, Harold
author_facet Stong, Nicholas
Deng, Zhong
Gupta, Ravi
Hu, Sufen
Paul, Shiela
Weiner, Amber K.
Eichler, Evan E.
Graves, Tina
Fronick, Catrina C.
Courtney, Laura
Wilson, Richard K.
Lieberman, Paul M.
Davuluri, Ramana V.
Riethman, Harold
author_sort Stong, Nicholas
collection PubMed
description Mapping genome-wide data to human subtelomeres has been problematic due to the incomplete assembly and challenges of low-copy repetitive DNA elements. Here, we provide updated human subtelomere sequence assemblies that were extended by filling telomere-adjacent gaps using clone-based resources. A bioinformatic pipeline incorporating multiread mapping for annotation of the updated assemblies using short-read data sets was developed and implemented. Annotation of subtelomeric sequence features as well as mapping of CTCF and cohesin binding sites using ChIP-seq data sets from multiple human cell types confirmed that CTCF and cohesin bind within 3 kb of the start of terminal repeat tracts at many, but not all, subtelomeres. CTCF and cohesin co-occupancy were also enriched near internal telomere-like sequence (ITS) islands and the nonterminal boundaries of subtelomere repeat elements (SREs) in transformed lymphoblastoid cell lines (LCLs) and human embryonic stem cell (ES) lines, but were not significantly enriched in the primary fibroblast IMR90 cell line. Subtelomeric CTCF and cohesin sites predicted by ChIP-seq using our bioinformatics pipeline (but not predicted when only uniquely mapping reads were considered) were consistently validated by ChIP-qPCR. The colocalized CTCF and cohesin sites in SRE regions are candidates for mediating long-range chromatin interactions in the transcript-rich SRE region. A public browser for the integrated display of short-read sequence–based annotations relative to key subtelomere features such as the start of each terminal repeat tract, SRE identity and organization, and subtelomeric gene models was established.
format Online
Article
Text
id pubmed-4032850
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-40328502014-12-01 Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline Stong, Nicholas Deng, Zhong Gupta, Ravi Hu, Sufen Paul, Shiela Weiner, Amber K. Eichler, Evan E. Graves, Tina Fronick, Catrina C. Courtney, Laura Wilson, Richard K. Lieberman, Paul M. Davuluri, Ramana V. Riethman, Harold Genome Res Resource Mapping genome-wide data to human subtelomeres has been problematic due to the incomplete assembly and challenges of low-copy repetitive DNA elements. Here, we provide updated human subtelomere sequence assemblies that were extended by filling telomere-adjacent gaps using clone-based resources. A bioinformatic pipeline incorporating multiread mapping for annotation of the updated assemblies using short-read data sets was developed and implemented. Annotation of subtelomeric sequence features as well as mapping of CTCF and cohesin binding sites using ChIP-seq data sets from multiple human cell types confirmed that CTCF and cohesin bind within 3 kb of the start of terminal repeat tracts at many, but not all, subtelomeres. CTCF and cohesin co-occupancy were also enriched near internal telomere-like sequence (ITS) islands and the nonterminal boundaries of subtelomere repeat elements (SREs) in transformed lymphoblastoid cell lines (LCLs) and human embryonic stem cell (ES) lines, but were not significantly enriched in the primary fibroblast IMR90 cell line. Subtelomeric CTCF and cohesin sites predicted by ChIP-seq using our bioinformatics pipeline (but not predicted when only uniquely mapping reads were considered) were consistently validated by ChIP-qPCR. The colocalized CTCF and cohesin sites in SRE regions are candidates for mediating long-range chromatin interactions in the transcript-rich SRE region. A public browser for the integrated display of short-read sequence–based annotations relative to key subtelomere features such as the start of each terminal repeat tract, SRE identity and organization, and subtelomeric gene models was established. Cold Spring Harbor Laboratory Press 2014-06 /pmc/articles/PMC4032850/ /pubmed/24676094 http://dx.doi.org/10.1101/gr.166983.113 Text en © 2014 Stong et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Stong, Nicholas
Deng, Zhong
Gupta, Ravi
Hu, Sufen
Paul, Shiela
Weiner, Amber K.
Eichler, Evan E.
Graves, Tina
Fronick, Catrina C.
Courtney, Laura
Wilson, Richard K.
Lieberman, Paul M.
Davuluri, Ramana V.
Riethman, Harold
Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title_full Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title_fullStr Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title_full_unstemmed Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title_short Subtelomeric CTCF and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
title_sort subtelomeric ctcf and cohesin binding site organization using improved subtelomere assemblies and a novel annotation pipeline
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4032850/
https://www.ncbi.nlm.nih.gov/pubmed/24676094
http://dx.doi.org/10.1101/gr.166983.113
work_keys_str_mv AT stongnicholas subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT dengzhong subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT guptaravi subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT husufen subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT paulshiela subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT weineramberk subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT eichlerevane subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT gravestina subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT fronickcatrinac subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT courtneylaura subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT wilsonrichardk subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT liebermanpaulm subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT davuluriramanav subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline
AT riethmanharold subtelomericctcfandcohesinbindingsiteorganizationusingimprovedsubtelomereassembliesandanovelannotationpipeline