Cargando…

GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence

Bacterial small RNAs (sRNAs) are important post-transcriptional regulators of gene expression. The functional and evolutionary characterization of sRNAs requires the identification of homologs, which is frequently challenging due to their heterogeneity, short length and partly, little sequence conse...

Descripción completa

Detalles Bibliográficos
Autores principales: Lott, Steffen C., Schäfer, Richard A., Mann, Martin, Backofen, Rolf, Hess, Wolfgang R., Voß, Björn, Georg, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913331/
https://www.ncbi.nlm.nih.gov/pubmed/29719549
http://dx.doi.org/10.3389/fgene.2018.00124
_version_ 1783316520310407168
author Lott, Steffen C.
Schäfer, Richard A.
Mann, Martin
Backofen, Rolf
Hess, Wolfgang R.
Voß, Björn
Georg, Jens
author_facet Lott, Steffen C.
Schäfer, Richard A.
Mann, Martin
Backofen, Rolf
Hess, Wolfgang R.
Voß, Björn
Georg, Jens
author_sort Lott, Steffen C.
collection PubMed
description Bacterial small RNAs (sRNAs) are important post-transcriptional regulators of gene expression. The functional and evolutionary characterization of sRNAs requires the identification of homologs, which is frequently challenging due to their heterogeneity, short length and partly, little sequence conservation. We developed the GLobal Automatic Small RNA Search go (GLASSgo) algorithm to identify sRNA homologs in complex genomic databases starting from a single sequence. GLASSgo combines an iterative BLAST strategy with pairwise identity filtering and a graph-based clustering method that utilizes RNA secondary structure information. We tested the specificity, sensitivity and runtime of GLASSgo, BLAST and the combination RNAlien/cmsearch in a typical use case scenario on 40 bacterial sRNA families. The sensitivity of the tested methods was similar, while the specificity of GLASSgo and RNAlien/cmsearch was significantly higher than that of BLAST. GLASSgo was on average ∼87 times faster than RNAlien/cmsearch, and only ∼7.5 times slower than BLAST, which shows that GLASSgo optimizes the trade-off between speed and accuracy in the task of finding sRNA homologs. GLASSgo is fully automated, whereas BLAST often recovers only parts of homologs and RNAlien/cmsearch requires extensive additional bioinformatic work to get a comprehensive set of homologs. GLASSgo is available as an easy-to-use web server to find homologous sRNAs in large databases.
format Online
Article
Text
id pubmed-5913331
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59133312018-05-01 GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence Lott, Steffen C. Schäfer, Richard A. Mann, Martin Backofen, Rolf Hess, Wolfgang R. Voß, Björn Georg, Jens Front Genet Genetics Bacterial small RNAs (sRNAs) are important post-transcriptional regulators of gene expression. The functional and evolutionary characterization of sRNAs requires the identification of homologs, which is frequently challenging due to their heterogeneity, short length and partly, little sequence conservation. We developed the GLobal Automatic Small RNA Search go (GLASSgo) algorithm to identify sRNA homologs in complex genomic databases starting from a single sequence. GLASSgo combines an iterative BLAST strategy with pairwise identity filtering and a graph-based clustering method that utilizes RNA secondary structure information. We tested the specificity, sensitivity and runtime of GLASSgo, BLAST and the combination RNAlien/cmsearch in a typical use case scenario on 40 bacterial sRNA families. The sensitivity of the tested methods was similar, while the specificity of GLASSgo and RNAlien/cmsearch was significantly higher than that of BLAST. GLASSgo was on average ∼87 times faster than RNAlien/cmsearch, and only ∼7.5 times slower than BLAST, which shows that GLASSgo optimizes the trade-off between speed and accuracy in the task of finding sRNA homologs. GLASSgo is fully automated, whereas BLAST often recovers only parts of homologs and RNAlien/cmsearch requires extensive additional bioinformatic work to get a comprehensive set of homologs. GLASSgo is available as an easy-to-use web server to find homologous sRNAs in large databases. Frontiers Media S.A. 2018-04-17 /pmc/articles/PMC5913331/ /pubmed/29719549 http://dx.doi.org/10.3389/fgene.2018.00124 Text en Copyright © 2018 Lott, Schäfer, Mann, Backofen, Hess, Voß and Georg. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Lott, Steffen C.
Schäfer, Richard A.
Mann, Martin
Backofen, Rolf
Hess, Wolfgang R.
Voß, Björn
Georg, Jens
GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title_full GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title_fullStr GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title_full_unstemmed GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title_short GLASSgo – Automated and Reliable Detection of sRNA Homologs From a Single Input Sequence
title_sort glassgo – automated and reliable detection of srna homologs from a single input sequence
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5913331/
https://www.ncbi.nlm.nih.gov/pubmed/29719549
http://dx.doi.org/10.3389/fgene.2018.00124
work_keys_str_mv AT lottsteffenc glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT schaferricharda glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT mannmartin glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT backofenrolf glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT hesswolfgangr glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT voßbjorn glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence
AT georgjens glassgoautomatedandreliabledetectionofsrnahomologsfromasingleinputsequence