Cargando…

Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts

MOTIVATION: Genome-wide profiles of chromatin accessibility and gene expression in diverse cellular contexts are critical to decipher the dynamics of transcriptional regulation. Recently, convolutional neural networks have been used to learn predictive cis-regulatory DNA sequence models of context-s...

Descripción completa

Detalles Bibliográficos
Autores principales: Nair, Surag, Kim, Daniel S, Perricone, Jacob, Kundaje, Anshul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612838/
https://www.ncbi.nlm.nih.gov/pubmed/31510655
http://dx.doi.org/10.1093/bioinformatics/btz352
_version_ 1783432948485193728
author Nair, Surag
Kim, Daniel S
Perricone, Jacob
Kundaje, Anshul
author_facet Nair, Surag
Kim, Daniel S
Perricone, Jacob
Kundaje, Anshul
author_sort Nair, Surag
collection PubMed
description MOTIVATION: Genome-wide profiles of chromatin accessibility and gene expression in diverse cellular contexts are critical to decipher the dynamics of transcriptional regulation. Recently, convolutional neural networks have been used to learn predictive cis-regulatory DNA sequence models of context-specific chromatin accessibility landscapes. However, these context-specific regulatory sequence models cannot generalize predictions across cell types. RESULTS: We introduce multi-modal, residual neural network architectures that integrate cis-regulatory sequence and context-specific expression of trans-regulators to predict genome-wide chromatin accessibility profiles across cellular contexts. We show that the average accessibility of a genomic region across training contexts can be a surprisingly powerful predictor. We leverage this feature and employ novel strategies for training models to enhance genome-wide prediction of shared and context-specific chromatin accessible sites across cell types. We interpret the models to reveal insights into cis- and trans-regulation of chromatin dynamics across 123 diverse cellular contexts. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/kundajelab/ChromDragoNN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6612838
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66128382019-07-12 Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts Nair, Surag Kim, Daniel S Perricone, Jacob Kundaje, Anshul Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Genome-wide profiles of chromatin accessibility and gene expression in diverse cellular contexts are critical to decipher the dynamics of transcriptional regulation. Recently, convolutional neural networks have been used to learn predictive cis-regulatory DNA sequence models of context-specific chromatin accessibility landscapes. However, these context-specific regulatory sequence models cannot generalize predictions across cell types. RESULTS: We introduce multi-modal, residual neural network architectures that integrate cis-regulatory sequence and context-specific expression of trans-regulators to predict genome-wide chromatin accessibility profiles across cellular contexts. We show that the average accessibility of a genomic region across training contexts can be a surprisingly powerful predictor. We leverage this feature and employ novel strategies for training models to enhance genome-wide prediction of shared and context-specific chromatin accessible sites across cell types. We interpret the models to reveal insights into cis- and trans-regulation of chromatin dynamics across 123 diverse cellular contexts. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/kundajelab/ChromDragoNN. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612838/ /pubmed/31510655 http://dx.doi.org/10.1093/bioinformatics/btz352 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2019 Conference Proceedings
Nair, Surag
Kim, Daniel S
Perricone, Jacob
Kundaje, Anshul
Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title_full Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title_fullStr Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title_full_unstemmed Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title_short Integrating regulatory DNA sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
title_sort integrating regulatory dna sequence and gene expression to predict genome-wide chromatin accessibility across cellular contexts
topic Ismb/Eccb 2019 Conference Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612838/
https://www.ncbi.nlm.nih.gov/pubmed/31510655
http://dx.doi.org/10.1093/bioinformatics/btz352
work_keys_str_mv AT nairsurag integratingregulatorydnasequenceandgeneexpressiontopredictgenomewidechromatinaccessibilityacrosscellularcontexts
AT kimdaniels integratingregulatorydnasequenceandgeneexpressiontopredictgenomewidechromatinaccessibilityacrosscellularcontexts
AT perriconejacob integratingregulatorydnasequenceandgeneexpressiontopredictgenomewidechromatinaccessibilityacrosscellularcontexts
AT kundajeanshul integratingregulatorydnasequenceandgeneexpressiontopredictgenomewidechromatinaccessibilityacrosscellularcontexts