Cargando…

A synergistic DNA logic predicts genome-wide chromatin accessibility

Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We d...

Descripción completa

Detalles Bibliográficos
Autores principales: Hashimoto, Tatsunori, Sherwood, Richard I., Kang, Daniel D., Rajagopal, Nisha, Barkal, Amira A., Zeng, Haoyang, Emons, Bart J.M., Srinivasan, Sharanya, Jaakkola, Tommi, Gifford, David K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052050/
https://www.ncbi.nlm.nih.gov/pubmed/27456004
http://dx.doi.org/10.1101/gr.199778.115
_version_ 1782458180590108672
author Hashimoto, Tatsunori
Sherwood, Richard I.
Kang, Daniel D.
Rajagopal, Nisha
Barkal, Amira A.
Zeng, Haoyang
Emons, Bart J.M.
Srinivasan, Sharanya
Jaakkola, Tommi
Gifford, David K.
author_facet Hashimoto, Tatsunori
Sherwood, Richard I.
Kang, Daniel D.
Rajagopal, Nisha
Barkal, Amira A.
Zeng, Haoyang
Emons, Bart J.M.
Srinivasan, Sharanya
Jaakkola, Tommi
Gifford, David K.
author_sort Hashimoto, Tatsunori
collection PubMed
description Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types.
format Online
Article
Text
id pubmed-5052050
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-50520502017-04-01 A synergistic DNA logic predicts genome-wide chromatin accessibility Hashimoto, Tatsunori Sherwood, Richard I. Kang, Daniel D. Rajagopal, Nisha Barkal, Amira A. Zeng, Haoyang Emons, Bart J.M. Srinivasan, Sharanya Jaakkola, Tommi Gifford, David K. Genome Res Method Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types. Cold Spring Harbor Laboratory Press 2016-10 /pmc/articles/PMC5052050/ /pubmed/27456004 http://dx.doi.org/10.1101/gr.199778.115 Text en © 2016 Hashimoto et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Hashimoto, Tatsunori
Sherwood, Richard I.
Kang, Daniel D.
Rajagopal, Nisha
Barkal, Amira A.
Zeng, Haoyang
Emons, Bart J.M.
Srinivasan, Sharanya
Jaakkola, Tommi
Gifford, David K.
A synergistic DNA logic predicts genome-wide chromatin accessibility
title A synergistic DNA logic predicts genome-wide chromatin accessibility
title_full A synergistic DNA logic predicts genome-wide chromatin accessibility
title_fullStr A synergistic DNA logic predicts genome-wide chromatin accessibility
title_full_unstemmed A synergistic DNA logic predicts genome-wide chromatin accessibility
title_short A synergistic DNA logic predicts genome-wide chromatin accessibility
title_sort synergistic dna logic predicts genome-wide chromatin accessibility
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052050/
https://www.ncbi.nlm.nih.gov/pubmed/27456004
http://dx.doi.org/10.1101/gr.199778.115
work_keys_str_mv AT hashimototatsunori asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT sherwoodrichardi asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT kangdanield asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT rajagopalnisha asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT barkalamiraa asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT zenghaoyang asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT emonsbartjm asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT srinivasansharanya asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT jaakkolatommi asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT gifforddavidk asynergisticdnalogicpredictsgenomewidechromatinaccessibility
AT hashimototatsunori synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT sherwoodrichardi synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT kangdanield synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT rajagopalnisha synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT barkalamiraa synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT zenghaoyang synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT emonsbartjm synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT srinivasansharanya synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT jaakkolatommi synergisticdnalogicpredictsgenomewidechromatinaccessibility
AT gifforddavidk synergisticdnalogicpredictsgenomewidechromatinaccessibility