Cargando…

Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding

MOTIVATION: Convolutional neural networks (CNNs) have outperformed conventional methods in modeling the sequence specificity of DNA–protein binding. While previous studies have built a connection between CNNs and probabilistic models, simple models of CNNs cannot achieve sufficient accuracy on this...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Xiao, Tu, Xinming, Ding, Yang, Gao, Ge, Deng, Minghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703793/
https://www.ncbi.nlm.nih.gov/pubmed/31598637
http://dx.doi.org/10.1093/bioinformatics/btz768
_version_ 1783616697859571712
author Luo, Xiao
Tu, Xinming
Ding, Yang
Gao, Ge
Deng, Minghua
author_facet Luo, Xiao
Tu, Xinming
Ding, Yang
Gao, Ge
Deng, Minghua
author_sort Luo, Xiao
collection PubMed
description MOTIVATION: Convolutional neural networks (CNNs) have outperformed conventional methods in modeling the sequence specificity of DNA–protein binding. While previous studies have built a connection between CNNs and probabilistic models, simple models of CNNs cannot achieve sufficient accuracy on this problem. Recently, some methods of neural networks have increased performance using complex neural networks whose results cannot be directly interpreted. However, it is difficult to combine probabilistic models and CNNs effectively to improve DNA–protein binding predictions. RESULTS: In this article, we present a novel global pooling method: expectation pooling for predicting DNA–protein binding. Our pooling method stems naturally from the expectation maximization algorithm, and its benefits can be interpreted both statistically and via deep learning theory. Through experiments, we demonstrate that our pooling method improves the prediction performance DNA–protein binding. Our interpretable pooling method combines probabilistic ideas with global pooling by taking the expectations of inputs without increasing the number of parameters. We also analyze the hyperparameters in our method and propose optional structures to help fit different datasets. We explore how to effectively utilize these novel pooling methods and show that combining statistical methods with deep learning is highly beneficial, which is promising and meaningful for future studies in this field. AVAILABILITY AND IMPLEMENTATION: All code is public in https://github.com/gao-lab/ePooling. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7703793
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-77037932020-12-07 Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding Luo, Xiao Tu, Xinming Ding, Yang Gao, Ge Deng, Minghua Bioinformatics Original Papers MOTIVATION: Convolutional neural networks (CNNs) have outperformed conventional methods in modeling the sequence specificity of DNA–protein binding. While previous studies have built a connection between CNNs and probabilistic models, simple models of CNNs cannot achieve sufficient accuracy on this problem. Recently, some methods of neural networks have increased performance using complex neural networks whose results cannot be directly interpreted. However, it is difficult to combine probabilistic models and CNNs effectively to improve DNA–protein binding predictions. RESULTS: In this article, we present a novel global pooling method: expectation pooling for predicting DNA–protein binding. Our pooling method stems naturally from the expectation maximization algorithm, and its benefits can be interpreted both statistically and via deep learning theory. Through experiments, we demonstrate that our pooling method improves the prediction performance DNA–protein binding. Our interpretable pooling method combines probabilistic ideas with global pooling by taking the expectations of inputs without increasing the number of parameters. We also analyze the hyperparameters in our method and propose optional structures to help fit different datasets. We explore how to effectively utilize these novel pooling methods and show that combining statistical methods with deep learning is highly beneficial, which is promising and meaningful for future studies in this field. AVAILABILITY AND IMPLEMENTATION: All code is public in https://github.com/gao-lab/ePooling. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-03 2019-10-09 /pmc/articles/PMC7703793/ /pubmed/31598637 http://dx.doi.org/10.1093/bioinformatics/btz768 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Luo, Xiao
Tu, Xinming
Ding, Yang
Gao, Ge
Deng, Minghua
Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title_full Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title_fullStr Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title_full_unstemmed Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title_short Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
title_sort expectation pooling: an effective and interpretable pooling method for predicting dna–protein binding
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7703793/
https://www.ncbi.nlm.nih.gov/pubmed/31598637
http://dx.doi.org/10.1093/bioinformatics/btz768
work_keys_str_mv AT luoxiao expectationpoolinganeffectiveandinterpretablepoolingmethodforpredictingdnaproteinbinding
AT tuxinming expectationpoolinganeffectiveandinterpretablepoolingmethodforpredictingdnaproteinbinding
AT dingyang expectationpoolinganeffectiveandinterpretablepoolingmethodforpredictingdnaproteinbinding
AT gaoge expectationpoolinganeffectiveandinterpretablepoolingmethodforpredictingdnaproteinbinding
AT dengminghua expectationpoolinganeffectiveandinterpretablepoolingmethodforpredictingdnaproteinbinding