Cargando…

NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks

Discovering DNA regulatory sequence motifs and their relative positions is vital to understanding the mechanisms of gene expression regulation. Although deep convolutional neural networks (CNNs) have achieved great success in predicting cis-regulatory elements, the discovery of motifs and their comb...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wei, Zheng, Hua, Kui, Wei, Lei, Ma, Shining, Jiang, Rui, Zhang, Xuegong, Li, Yanda, Wong, Wing H., Wang, Xiaowo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	National Academy of Sciences 2023
Materias:	Biological Sciences
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104575/ https://www.ncbi.nlm.nih.gov/pubmed/37023129 http://dx.doi.org/10.1073/pnas.2216698120

_version_	1785026068120338432
author	Wei, Zheng Hua, Kui Wei, Lei Ma, Shining Jiang, Rui Zhang, Xuegong Li, Yanda Wong, Wing H. Wang, Xiaowo
author_facet	Wei, Zheng Hua, Kui Wei, Lei Ma, Shining Jiang, Rui Zhang, Xuegong Li, Yanda Wong, Wing H. Wang, Xiaowo
author_sort	Wei, Zheng
collection	PubMed
description	Discovering DNA regulatory sequence motifs and their relative positions is vital to understanding the mechanisms of gene expression regulation. Although deep convolutional neural networks (CNNs) have achieved great success in predicting cis-regulatory elements, the discovery of motifs and their combinatorial patterns from these CNN models has remained difficult. We show that the main difficulty is due to the problem of multifaceted neurons which respond to multiple types of sequence patterns. Since existing interpretation methods were mainly designed to visualize the class of sequences that can activate the neuron, the resulting visualization will correspond to a mixture of patterns. Such a mixture is usually difficult to interpret without resolving the mixed patterns. We propose the NeuronMotif algorithm to interpret such neurons. Given any convolutional neuron (CN) in the network, NeuronMotif first generates a large sample of sequences capable of activating the CN, which typically consists of a mixture of patterns. Then, the sequences are “demixed” in a layer-wise manner by backward clustering of the feature maps of the involved convolutional layers. NeuronMotif can output the sequence motifs, and the syntax rules governing their combinations are depicted by position weight matrices organized in tree structures. Compared to existing methods, the motifs found by NeuronMotif have more matches to known motifs in the JASPAR database. The higher-order patterns uncovered for deep CNs are supported by the literature and ATAC-seq footprinting. Overall, NeuronMotif enables the deciphering of cis-regulatory codes from deep CNs and enhances the utility of CNN in genome interpretation.
format	Online Article Text
id	pubmed-10104575
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	National Academy of Sciences
record_format	MEDLINE/PubMed
spelling	pubmed-101045752023-04-15 NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks Wei, Zheng Hua, Kui Wei, Lei Ma, Shining Jiang, Rui Zhang, Xuegong Li, Yanda Wong, Wing H. Wang, Xiaowo Proc Natl Acad Sci U S A Biological Sciences Discovering DNA regulatory sequence motifs and their relative positions is vital to understanding the mechanisms of gene expression regulation. Although deep convolutional neural networks (CNNs) have achieved great success in predicting cis-regulatory elements, the discovery of motifs and their combinatorial patterns from these CNN models has remained difficult. We show that the main difficulty is due to the problem of multifaceted neurons which respond to multiple types of sequence patterns. Since existing interpretation methods were mainly designed to visualize the class of sequences that can activate the neuron, the resulting visualization will correspond to a mixture of patterns. Such a mixture is usually difficult to interpret without resolving the mixed patterns. We propose the NeuronMotif algorithm to interpret such neurons. Given any convolutional neuron (CN) in the network, NeuronMotif first generates a large sample of sequences capable of activating the CN, which typically consists of a mixture of patterns. Then, the sequences are “demixed” in a layer-wise manner by backward clustering of the feature maps of the involved convolutional layers. NeuronMotif can output the sequence motifs, and the syntax rules governing their combinations are depicted by position weight matrices organized in tree structures. Compared to existing methods, the motifs found by NeuronMotif have more matches to known motifs in the JASPAR database. The higher-order patterns uncovered for deep CNs are supported by the literature and ATAC-seq footprinting. Overall, NeuronMotif enables the deciphering of cis-regulatory codes from deep CNs and enhances the utility of CNN in genome interpretation. National Academy of Sciences 2023-04-06 2023-04-11 /pmc/articles/PMC10104575/ /pubmed/37023129 http://dx.doi.org/10.1073/pnas.2216698120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle	Biological Sciences Wei, Zheng Hua, Kui Wei, Lei Ma, Shining Jiang, Rui Zhang, Xuegong Li, Yanda Wong, Wing H. Wang, Xiaowo NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title	NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title_full	NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title_fullStr	NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title_full_unstemmed	NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title_short	NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
title_sort	neuronmotif: deciphering cis-regulatory codes by layer-wise demixing of deep neural networks
topic	Biological Sciences
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104575/ https://www.ncbi.nlm.nih.gov/pubmed/37023129 http://dx.doi.org/10.1073/pnas.2216698120
work_keys_str_mv	AT weizheng neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT huakui neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT weilei neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT mashining neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT jiangrui neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT zhangxuegong neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT liyanda neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT wongwingh neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks AT wangxiaowo neuronmotifdecipheringcisregulatorycodesbylayerwisedemixingofdeepneuralnetworks

NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks

Ejemplares similares