Cargando…

Discovering epistatic feature interactions from neural network models of regulatory DNA sequences

MOTIVATION: Transcription factors bind regulatory DNA sequences in a combinatorial manner to modulate gene expression. Deep neural networks (DNNs) can learn the cis-regulatory grammars encoded in regulatory DNA sequences associated with transcription factor binding and chromatin accessibility. Sever...

Descripción completa

Detalles Bibliográficos
Autores principales: Greenside, Peyton, Shimko, Tyler, Fordyce, Polly, Kundaje, Anshul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129272/
https://www.ncbi.nlm.nih.gov/pubmed/30423062
http://dx.doi.org/10.1093/bioinformatics/bty575
_version_ 1783353771482415104
author Greenside, Peyton
Shimko, Tyler
Fordyce, Polly
Kundaje, Anshul
author_facet Greenside, Peyton
Shimko, Tyler
Fordyce, Polly
Kundaje, Anshul
author_sort Greenside, Peyton
collection PubMed
description MOTIVATION: Transcription factors bind regulatory DNA sequences in a combinatorial manner to modulate gene expression. Deep neural networks (DNNs) can learn the cis-regulatory grammars encoded in regulatory DNA sequences associated with transcription factor binding and chromatin accessibility. Several feature attribution methods have been developed for estimating the predictive importance of individual features (nucleotides or motifs) in any input DNA sequence to its associated output prediction from a DNN model. However, these methods do not reveal higher-order feature interactions encoded by the models. RESULTS: We present a new method called Deep Feature Interaction Maps (DFIM) to efficiently estimate interactions between all pairs of features in any input DNA sequence. DFIM accurately identifies ground truth motif interactions embedded in simulated regulatory DNA sequences. DFIM identifies synergistic interactions between GATA1 and TAL1 motifs from in vivo TF binding models. DFIM reveals epistatic interactions involving nucleotides flanking the core motif of the Cbf1 TF in yeast from in vitro TF binding models. We also apply DFIM to regulatory sequence models of in vivo chromatin accessibility to reveal interactions between regulatory genetic variants and proximal motifs of target TFs as validated by TF binding quantitative trait loci. Our approach makes significant strides in improving the interpretability of deep learning models for genomics. AVAILABILITY AND IMPLEMENTATION: Code is available at: https://github.com/kundajelab/dfim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6129272
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61292722018-09-12 Discovering epistatic feature interactions from neural network models of regulatory DNA sequences Greenside, Peyton Shimko, Tyler Fordyce, Polly Kundaje, Anshul Bioinformatics Eccb 2018: European Conference on Computational Biology Proceedings MOTIVATION: Transcription factors bind regulatory DNA sequences in a combinatorial manner to modulate gene expression. Deep neural networks (DNNs) can learn the cis-regulatory grammars encoded in regulatory DNA sequences associated with transcription factor binding and chromatin accessibility. Several feature attribution methods have been developed for estimating the predictive importance of individual features (nucleotides or motifs) in any input DNA sequence to its associated output prediction from a DNN model. However, these methods do not reveal higher-order feature interactions encoded by the models. RESULTS: We present a new method called Deep Feature Interaction Maps (DFIM) to efficiently estimate interactions between all pairs of features in any input DNA sequence. DFIM accurately identifies ground truth motif interactions embedded in simulated regulatory DNA sequences. DFIM identifies synergistic interactions between GATA1 and TAL1 motifs from in vivo TF binding models. DFIM reveals epistatic interactions involving nucleotides flanking the core motif of the Cbf1 TF in yeast from in vitro TF binding models. We also apply DFIM to regulatory sequence models of in vivo chromatin accessibility to reveal interactions between regulatory genetic variants and proximal motifs of target TFs as validated by TF binding quantitative trait loci. Our approach makes significant strides in improving the interpretability of deep learning models for genomics. AVAILABILITY AND IMPLEMENTATION: Code is available at: https://github.com/kundajelab/dfim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-09-01 2018-09-08 /pmc/articles/PMC6129272/ /pubmed/30423062 http://dx.doi.org/10.1093/bioinformatics/bty575 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2018: European Conference on Computational Biology Proceedings
Greenside, Peyton
Shimko, Tyler
Fordyce, Polly
Kundaje, Anshul
Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title_full Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title_fullStr Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title_full_unstemmed Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title_short Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
title_sort discovering epistatic feature interactions from neural network models of regulatory dna sequences
topic Eccb 2018: European Conference on Computational Biology Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6129272/
https://www.ncbi.nlm.nih.gov/pubmed/30423062
http://dx.doi.org/10.1093/bioinformatics/bty575
work_keys_str_mv AT greensidepeyton discoveringepistaticfeatureinteractionsfromneuralnetworkmodelsofregulatorydnasequences
AT shimkotyler discoveringepistaticfeatureinteractionsfromneuralnetworkmodelsofregulatorydnasequences
AT fordycepolly discoveringepistaticfeatureinteractionsfromneuralnetworkmodelsofregulatorydnasequences
AT kundajeanshul discoveringepistaticfeatureinteractionsfromneuralnetworkmodelsofregulatorydnasequences