Cargando…

Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data

The assay for transposase-accessible chromatin followed by sequencing (ATAC-seq) is an inexpensive protocol for measuring open chromatin regions. ATAC-seq is also relatively simple and requires fewer cells than many other high-throughput sequencing protocols. Therefore, it is tractable in numerous s...

Descripción completa

Detalles Bibliográficos
Autores principales: Tripodi, Ignacio J., Chowdhury, Murad, Gruca, Margaret, Dowell, Robin D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7192442/
https://www.ncbi.nlm.nih.gov/pubmed/32353042
http://dx.doi.org/10.1371/journal.pone.0232332
_version_ 1783528009569927168
author Tripodi, Ignacio J.
Chowdhury, Murad
Gruca, Margaret
Dowell, Robin D.
author_facet Tripodi, Ignacio J.
Chowdhury, Murad
Gruca, Margaret
Dowell, Robin D.
author_sort Tripodi, Ignacio J.
collection PubMed
description The assay for transposase-accessible chromatin followed by sequencing (ATAC-seq) is an inexpensive protocol for measuring open chromatin regions. ATAC-seq is also relatively simple and requires fewer cells than many other high-throughput sequencing protocols. Therefore, it is tractable in numerous settings where other high throughput assays are challenging to impossible. Hence it is important to understand the limits of what can be inferred from ATAC-seq data. In this work, we leverage ATAC-seq to predict the presence of nascent transcription. Nascent transcription assays are the current gold standard for identifying regions of active transcription, including markers for functional transcription factor (TF) binding. We combine mapped short reads from ATAC-seq with the underlying peak sequence, to determine regions of active transcription genome-wide. We show that a hybrid signal/sequence representation classified using recurrent neural networks (RNNs) can identify these regions across different cell types.
format Online
Article
Text
id pubmed-7192442
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71924422020-05-11 Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data Tripodi, Ignacio J. Chowdhury, Murad Gruca, Margaret Dowell, Robin D. PLoS One Research Article The assay for transposase-accessible chromatin followed by sequencing (ATAC-seq) is an inexpensive protocol for measuring open chromatin regions. ATAC-seq is also relatively simple and requires fewer cells than many other high-throughput sequencing protocols. Therefore, it is tractable in numerous settings where other high throughput assays are challenging to impossible. Hence it is important to understand the limits of what can be inferred from ATAC-seq data. In this work, we leverage ATAC-seq to predict the presence of nascent transcription. Nascent transcription assays are the current gold standard for identifying regions of active transcription, including markers for functional transcription factor (TF) binding. We combine mapped short reads from ATAC-seq with the underlying peak sequence, to determine regions of active transcription genome-wide. We show that a hybrid signal/sequence representation classified using recurrent neural networks (RNNs) can identify these regions across different cell types. Public Library of Science 2020-04-30 /pmc/articles/PMC7192442/ /pubmed/32353042 http://dx.doi.org/10.1371/journal.pone.0232332 Text en © 2020 Tripodi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Tripodi, Ignacio J.
Chowdhury, Murad
Gruca, Margaret
Dowell, Robin D.
Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title_full Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title_fullStr Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title_full_unstemmed Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title_short Combining signal and sequence to detect RNA polymerase initiation in ATAC-seq data
title_sort combining signal and sequence to detect rna polymerase initiation in atac-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7192442/
https://www.ncbi.nlm.nih.gov/pubmed/32353042
http://dx.doi.org/10.1371/journal.pone.0232332
work_keys_str_mv AT tripodiignacioj combiningsignalandsequencetodetectrnapolymeraseinitiationinatacseqdata
AT chowdhurymurad combiningsignalandsequencetodetectrnapolymeraseinitiationinatacseqdata
AT grucamargaret combiningsignalandsequencetodetectrnapolymeraseinitiationinatacseqdata
AT dowellrobind combiningsignalandsequencetodetectrnapolymeraseinitiationinatacseqdata