Cargando…

Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding

Convolutional neural network (CNN) based methods have outperformed conventional machine learning methods in predicting the binding preference of DNA-protein binding. Although studies in the past have shown that more convolutional kernels help to achieve better performance, visualization of the model...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Xiao, Chi, Weilai, Deng, Minghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6879555/
https://www.ncbi.nlm.nih.gov/pubmed/31824562
http://dx.doi.org/10.3389/fgene.2019.01145
_version_ 1783473624434343936
author Luo, Xiao
Chi, Weilai
Deng, Minghua
author_facet Luo, Xiao
Chi, Weilai
Deng, Minghua
author_sort Luo, Xiao
collection PubMed
description Convolutional neural network (CNN) based methods have outperformed conventional machine learning methods in predicting the binding preference of DNA-protein binding. Although studies in the past have shown that more convolutional kernels help to achieve better performance, visualization of the model can be obscured by the use of many kernels, resulting in overfitting and reduced interpretation because the number of motifs in true models is limited. Therefore, we aim to arrive at high performance, but with limited kernel numbers, in CNN-based models for motif inference. We herein present Deepprune, a novel deep learning framework, which prunes the weights in the dense layer and fine-tunes iteratively. These two steps enable the training of CNN-based models with limited kernel numbers, allowing easy interpretation of the learned model. We demonstrate that Deepprune significantly improves motif inference performance for the simulated datasets. Furthermore, we show that Deepprune outperforms the baseline with limited kernel numbers when inferring DNA-binding sites from ChIP-seq data.
format Online
Article
Text
id pubmed-6879555
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-68795552019-12-10 Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding Luo, Xiao Chi, Weilai Deng, Minghua Front Genet Genetics Convolutional neural network (CNN) based methods have outperformed conventional machine learning methods in predicting the binding preference of DNA-protein binding. Although studies in the past have shown that more convolutional kernels help to achieve better performance, visualization of the model can be obscured by the use of many kernels, resulting in overfitting and reduced interpretation because the number of motifs in true models is limited. Therefore, we aim to arrive at high performance, but with limited kernel numbers, in CNN-based models for motif inference. We herein present Deepprune, a novel deep learning framework, which prunes the weights in the dense layer and fine-tunes iteratively. These two steps enable the training of CNN-based models with limited kernel numbers, allowing easy interpretation of the learned model. We demonstrate that Deepprune significantly improves motif inference performance for the simulated datasets. Furthermore, we show that Deepprune outperforms the baseline with limited kernel numbers when inferring DNA-binding sites from ChIP-seq data. Frontiers Media S.A. 2019-11-20 /pmc/articles/PMC6879555/ /pubmed/31824562 http://dx.doi.org/10.3389/fgene.2019.01145 Text en Copyright © 2019 Luo, Chi and Deng http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Luo, Xiao
Chi, Weilai
Deng, Minghua
Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title_full Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title_fullStr Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title_full_unstemmed Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title_short Deepprune: Learning Efficient and Interpretable Convolutional Networks Through Weight Pruning for Predicting DNA-Protein Binding
title_sort deepprune: learning efficient and interpretable convolutional networks through weight pruning for predicting dna-protein binding
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6879555/
https://www.ncbi.nlm.nih.gov/pubmed/31824562
http://dx.doi.org/10.3389/fgene.2019.01145
work_keys_str_mv AT luoxiao deepprunelearningefficientandinterpretableconvolutionalnetworksthroughweightpruningforpredictingdnaproteinbinding
AT chiweilai deepprunelearningefficientandinterpretableconvolutionalnetworksthroughweightpruningforpredictingdnaproteinbinding
AT dengminghua deepprunelearningefficientandinterpretableconvolutionalnetworksthroughweightpruningforpredictingdnaproteinbinding