Cargando…

Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities

MOTIVATION: Deep learning architectures have recently demonstrated their power in predicting DNA- and RNA-binding specificity. Existing methods fall into three classes: Some are based on convolutional neural networks (CNNs), others use recurrent neural networks (RNNs) and others rely on hybrid archi...

Descripción completa

Detalles Bibliográficos
Autores principales: Trabelsi, Ameni, Chaabane, Mohamed, Ben-Hur, Asa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612801/
https://www.ncbi.nlm.nih.gov/pubmed/31510640
http://dx.doi.org/10.1093/bioinformatics/btz339
_version_ 1783432940077711360
author Trabelsi, Ameni
Chaabane, Mohamed
Ben-Hur, Asa
author_facet Trabelsi, Ameni
Chaabane, Mohamed
Ben-Hur, Asa
author_sort Trabelsi, Ameni
collection PubMed
description MOTIVATION: Deep learning architectures have recently demonstrated their power in predicting DNA- and RNA-binding specificity. Existing methods fall into three classes: Some are based on convolutional neural networks (CNNs), others use recurrent neural networks (RNNs) and others rely on hybrid architectures combining CNNs and RNNs. However, based on existing studies the relative merit of the various architectures remains unclear. RESULTS: In this study we present a systematic exploration of deep learning architectures for predicting DNA- and RNA-binding specificity. For this purpose, we present deepRAM, an end-to-end deep learning tool that provides an implementation of a wide selection of architectures; its fully automatic model selection procedure allows us to perform a fair and unbiased comparison of deep learning architectures. We find that deeper more complex architectures provide a clear advantage with sufficient training data, and that hybrid CNN/RNN architectures outperform other methods in terms of accuracy. Our work provides guidelines that can assist the practitioner in choosing an appropriate network architecture, and provides insight on the difference between the models learned by convolutional and recurrent networks. In particular, we find that although recurrent networks improve model accuracy, this comes at the expense of a loss in the interpretability of the features learned by the model. AVAILABILITY AND IMPLEMENTATION: The source code for deepRAM is available at https://github.com/MedChaabane/deepRAM. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6612801
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66128012019-07-12 Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities Trabelsi, Ameni Chaabane, Mohamed Ben-Hur, Asa Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Deep learning architectures have recently demonstrated their power in predicting DNA- and RNA-binding specificity. Existing methods fall into three classes: Some are based on convolutional neural networks (CNNs), others use recurrent neural networks (RNNs) and others rely on hybrid architectures combining CNNs and RNNs. However, based on existing studies the relative merit of the various architectures remains unclear. RESULTS: In this study we present a systematic exploration of deep learning architectures for predicting DNA- and RNA-binding specificity. For this purpose, we present deepRAM, an end-to-end deep learning tool that provides an implementation of a wide selection of architectures; its fully automatic model selection procedure allows us to perform a fair and unbiased comparison of deep learning architectures. We find that deeper more complex architectures provide a clear advantage with sufficient training data, and that hybrid CNN/RNN architectures outperform other methods in terms of accuracy. Our work provides guidelines that can assist the practitioner in choosing an appropriate network architecture, and provides insight on the difference between the models learned by convolutional and recurrent networks. In particular, we find that although recurrent networks improve model accuracy, this comes at the expense of a loss in the interpretability of the features learned by the model. AVAILABILITY AND IMPLEMENTATION: The source code for deepRAM is available at https://github.com/MedChaabane/deepRAM. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612801/ /pubmed/31510640 http://dx.doi.org/10.1093/bioinformatics/btz339 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2019 Conference Proceedings
Trabelsi, Ameni
Chaabane, Mohamed
Ben-Hur, Asa
Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title_full Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title_fullStr Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title_full_unstemmed Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title_short Comprehensive evaluation of deep learning architectures for prediction of DNA/RNA sequence binding specificities
title_sort comprehensive evaluation of deep learning architectures for prediction of dna/rna sequence binding specificities
topic Ismb/Eccb 2019 Conference Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612801/
https://www.ncbi.nlm.nih.gov/pubmed/31510640
http://dx.doi.org/10.1093/bioinformatics/btz339
work_keys_str_mv AT trabelsiameni comprehensiveevaluationofdeeplearningarchitecturesforpredictionofdnarnasequencebindingspecificities
AT chaabanemohamed comprehensiveevaluationofdeeplearningarchitecturesforpredictionofdnarnasequencebindingspecificities
AT benhurasa comprehensiveevaluationofdeeplearningarchitecturesforpredictionofdnarnasequencebindingspecificities