Cargando…

An improved method for identification of small non-coding RNAs in bacteria using support vector machine

Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experim...

Descripción completa

Detalles Bibliográficos
Autores principales: Barman, Ranjan Kumar, Mukhopadhyay, Anirban, Das, Santasabuj
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382675/
https://www.ncbi.nlm.nih.gov/pubmed/28383059
http://dx.doi.org/10.1038/srep46070
_version_ 1782520145638326272
author Barman, Ranjan Kumar
Mukhopadhyay, Anirban
Das, Santasabuj
author_facet Barman, Ranjan Kumar
Mukhopadhyay, Anirban
Das, Santasabuj
author_sort Barman, Ranjan Kumar
collection PubMed
description Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively.
format Online
Article
Text
id pubmed-5382675
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53826752017-04-11 An improved method for identification of small non-coding RNAs in bacteria using support vector machine Barman, Ranjan Kumar Mukhopadhyay, Anirban Das, Santasabuj Sci Rep Article Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively. Nature Publishing Group 2017-04-06 /pmc/articles/PMC5382675/ /pubmed/28383059 http://dx.doi.org/10.1038/srep46070 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Barman, Ranjan Kumar
Mukhopadhyay, Anirban
Das, Santasabuj
An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title_full An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title_fullStr An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title_full_unstemmed An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title_short An improved method for identification of small non-coding RNAs in bacteria using support vector machine
title_sort improved method for identification of small non-coding rnas in bacteria using support vector machine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382675/
https://www.ncbi.nlm.nih.gov/pubmed/28383059
http://dx.doi.org/10.1038/srep46070
work_keys_str_mv AT barmanranjankumar animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine
AT mukhopadhyayanirban animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine
AT dassantasabuj animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine
AT barmanranjankumar improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine
AT mukhopadhyayanirban improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine
AT dassantasabuj improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine