Cargando…
An improved method for identification of small non-coding RNAs in bacteria using support vector machine
Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experim...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382675/ https://www.ncbi.nlm.nih.gov/pubmed/28383059 http://dx.doi.org/10.1038/srep46070 |
_version_ | 1782520145638326272 |
---|---|
author | Barman, Ranjan Kumar Mukhopadhyay, Anirban Das, Santasabuj |
author_facet | Barman, Ranjan Kumar Mukhopadhyay, Anirban Das, Santasabuj |
author_sort | Barman, Ranjan Kumar |
collection | PubMed |
description | Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively. |
format | Online Article Text |
id | pubmed-5382675 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53826752017-04-11 An improved method for identification of small non-coding RNAs in bacteria using support vector machine Barman, Ranjan Kumar Mukhopadhyay, Anirban Das, Santasabuj Sci Rep Article Bacterial small non-coding RNAs (sRNAs) are not translated into proteins, but act as functional RNAs. They are involved in diverse biological processes like virulence, stress response and quorum sensing. Several high-throughput techniques have enabled identification of sRNAs in bacteria, but experimental detection remains a challenge and grossly incomplete for most species. Thus, there is a need to develop computational tools to predict bacterial sRNAs. Here, we propose a computational method to identify sRNAs in bacteria using support vector machine (SVM) classifier. The primary sequence and secondary structure features of experimentally-validated sRNAs of Salmonella Typhimurium LT2 (SLT2) was used to build the optimal SVM model. We found that a tri-nucleotide composition feature of sRNAs achieved an accuracy of 88.35% for SLT2. We validated the SVM model also on the experimentally-detected sRNAs of E. coli and Salmonella Typhi. The proposed model had robustly attained an accuracy of 81.25% and 88.82% for E. coli K-12 and S. Typhi Ty2, respectively. We confirmed that this method significantly improved the identification of sRNAs in bacteria. Furthermore, we used a sliding window-based method and identified sRNAs from complete genomes of SLT2, S. Typhi Ty2 and E. coli K-12 with sensitivities of 89.09%, 83.33% and 67.39%, respectively. Nature Publishing Group 2017-04-06 /pmc/articles/PMC5382675/ /pubmed/28383059 http://dx.doi.org/10.1038/srep46070 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Barman, Ranjan Kumar Mukhopadhyay, Anirban Das, Santasabuj An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title | An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title_full | An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title_fullStr | An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title_full_unstemmed | An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title_short | An improved method for identification of small non-coding RNAs in bacteria using support vector machine |
title_sort | improved method for identification of small non-coding rnas in bacteria using support vector machine |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5382675/ https://www.ncbi.nlm.nih.gov/pubmed/28383059 http://dx.doi.org/10.1038/srep46070 |
work_keys_str_mv | AT barmanranjankumar animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine AT mukhopadhyayanirban animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine AT dassantasabuj animprovedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine AT barmanranjankumar improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine AT mukhopadhyayanirban improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine AT dassantasabuj improvedmethodforidentificationofsmallnoncodingrnasinbacteriausingsupportvectormachine |