Cargando…

Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy

Riboswitches are cis-regulatory genetic elements that use an aptamer to control gene expression. Specificity to cognate ligand and diversity of such ligands have expanded the functional repertoire of riboswitches to mediate mounting apt responses to sudden metabolic demands and signal changes in env...

Descripción completa

Detalles Bibliográficos
Autores principales: Premkumar, Keshav Aditya R., Bharanikumar, Ramit, Palaniappan, Ashok
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7371854/
https://www.ncbi.nlm.nih.gov/pubmed/32760712
http://dx.doi.org/10.3389/fbioe.2020.00808
_version_ 1783561190984646656
author Premkumar, Keshav Aditya R.
Bharanikumar, Ramit
Palaniappan, Ashok
author_facet Premkumar, Keshav Aditya R.
Bharanikumar, Ramit
Palaniappan, Ashok
author_sort Premkumar, Keshav Aditya R.
collection PubMed
description Riboswitches are cis-regulatory genetic elements that use an aptamer to control gene expression. Specificity to cognate ligand and diversity of such ligands have expanded the functional repertoire of riboswitches to mediate mounting apt responses to sudden metabolic demands and signal changes in environmental conditions. Given their critical role in microbial life, riboswitch characterisation remains a challenging computational problem. Here we have addressed the issue with advanced deep learning frameworks, namely convolutional neural networks (CNN), and bidirectional recurrent neural networks (RNN) with Long Short-Term Memory (LSTM). Using a comprehensive dataset of 32 ligand classes and a stratified train-validate-test approach, we demonstrated the accurate performance of both the deep learning models (CNN and RNN) relative to conventional hyperparameter-optimized machine learning classifiers on all key performance metrics, including the ROC curve analysis. In particular, the bidirectional LSTM RNN emerged as the best-performing learning method for identifying the ligand-specificity of riboswitches with an accuracy >0.99 and macro-averaged F-score of 0.96. An additional attraction is that the deep learning models do not require prior feature engineering. A dynamic update functionality is built into the models to factor for the constant discovery of new riboswitches, and extend the predictive modeling to new classes. Our work would enable the design of genetic circuits with custom-tuned riboswitch aptamers that would effect precise translational control in synthetic biology. The associated software is available as an open-source Python package and standalone resource for use in genome annotation, synthetic biology, and biotechnology workflows.
format Online
Article
Text
id pubmed-7371854
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-73718542020-08-04 Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy Premkumar, Keshav Aditya R. Bharanikumar, Ramit Palaniappan, Ashok Front Bioeng Biotechnol Bioengineering and Biotechnology Riboswitches are cis-regulatory genetic elements that use an aptamer to control gene expression. Specificity to cognate ligand and diversity of such ligands have expanded the functional repertoire of riboswitches to mediate mounting apt responses to sudden metabolic demands and signal changes in environmental conditions. Given their critical role in microbial life, riboswitch characterisation remains a challenging computational problem. Here we have addressed the issue with advanced deep learning frameworks, namely convolutional neural networks (CNN), and bidirectional recurrent neural networks (RNN) with Long Short-Term Memory (LSTM). Using a comprehensive dataset of 32 ligand classes and a stratified train-validate-test approach, we demonstrated the accurate performance of both the deep learning models (CNN and RNN) relative to conventional hyperparameter-optimized machine learning classifiers on all key performance metrics, including the ROC curve analysis. In particular, the bidirectional LSTM RNN emerged as the best-performing learning method for identifying the ligand-specificity of riboswitches with an accuracy >0.99 and macro-averaged F-score of 0.96. An additional attraction is that the deep learning models do not require prior feature engineering. A dynamic update functionality is built into the models to factor for the constant discovery of new riboswitches, and extend the predictive modeling to new classes. Our work would enable the design of genetic circuits with custom-tuned riboswitch aptamers that would effect precise translational control in synthetic biology. The associated software is available as an open-source Python package and standalone resource for use in genome annotation, synthetic biology, and biotechnology workflows. Frontiers Media S.A. 2020-07-14 /pmc/articles/PMC7371854/ /pubmed/32760712 http://dx.doi.org/10.3389/fbioe.2020.00808 Text en Copyright © 2020 Premkumar, Bharanikumar and Palaniappan. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Premkumar, Keshav Aditya R.
Bharanikumar, Ramit
Palaniappan, Ashok
Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title_full Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title_fullStr Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title_full_unstemmed Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title_short Riboflow: Using Deep Learning to Classify Riboswitches With ∼99% Accuracy
title_sort riboflow: using deep learning to classify riboswitches with ∼99% accuracy
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7371854/
https://www.ncbi.nlm.nih.gov/pubmed/32760712
http://dx.doi.org/10.3389/fbioe.2020.00808
work_keys_str_mv AT premkumarkeshavadityar riboflowusingdeeplearningtoclassifyriboswitcheswith99accuracy
AT bharanikumarramit riboflowusingdeeplearningtoclassifyriboswitcheswith99accuracy
AT palaniappanashok riboflowusingdeeplearningtoclassifyriboswitcheswith99accuracy